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PREFACE 


The Conference on Optical Information Processing for Aerospace Applications was 
held on August 18-19, 1981, at Langley Research Center, Hampton, Virginia. The 
purpose of this conference was to review current research in optical processing and 
to determine its role in future aerospace systems. It is clear from these proceed- 
ings that optical processing offers significant potential in the fields of aircraft 
and spacecraft control, pattern recognition, and robotics. The development of opti- 
cal devices and components has demonstrated that system concepts can be implemented 
in practical aerospace configurations. 

This conference was sponsored by the Office of Aeronautics and Space Technology 
(OAST) of NASA Headquarters and managed by Langley Research Center. Special thanks 
are offered to Dr. Martin Sokoloski of OAST, Rudolfo Segura (technical chairman), and 
the many others who gave of their time and talents. 

Use of trade names or names of manufacturers in this report does not constitute 
an official endorsement of such products or manufacturers, either expressed or 
implied, by the National Aeronautics and Space Administration. 


Robert L. Stermer 
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AIR FORCE RESEARCH IN OPTICAL PROCESSING 


John Neff 
AFOSR/NE 
Bolling AFB 
Washington, DC 20332 


SUMMARY 

The Air Force is actively supporting scientific research in 
optical processing through the awarding of contracts and grants to 
university and industrial research laboratories. The emphasis is on 
optical and optical-electronic hybrid processing especially in the 
application area of image processing. A special interest exists in 
real-time pattern recognition processors for such airborne missions as 
target recognition, tracking, and terminal guidance. This paper will 
describe the areas of interest and the ongoing efforts in the Air Force 
research program. 


INTRODUCTION 

The history of computer technology indicates quite clearly that 
with the development of every new computer of larger capacity and faster 
speed the number of problem solving areas which would be benefitted 
increases and multiplies. During the past twenty years, the progress in 
solid state electronic computers has been extremely rapid, to the point 
that computers influence almost every aspect of our lives. Even though 
continuing progress in electronic computer technology can be anticipated, 
we would be remiss to overlook other techniques which show promise of 
complementing or even replacing the presently prevalent silicon LSI 
wafer. One promising alternative is provided by optics, including 
electro-optics and acousto-optics. And certainly not to be overlooked is 
the area of hybrid optical-analog/electronic-digital processing. Optical 
systems offer parallel computations with speeds unattainable with present 
digital computer technology, and the hybrid systems combine this speed 
and data handling capability with the computational flexibility and 
accuracy of digital processing. The development of efficient hybrid 
processors along with continued advancements in pattern recognition and 
computer-aided decision making will provide us with critical technology 
needed to maintain our world leadership in computing and signal 
processing capability. 

One of the most promising application areas for this emerging 
technology is image processing. Imaging sensors are becoming an 
increasingly important part of such military systems as reconnaisance, 
missile guidance, target identification and tracking, and space 
surveillance. Sensors of the future are expected to provide high 
resolution images at frame rates which will produce data in excess of 



10^ bits per second--a rate that digital technology will be hard 
pressed to handle even under the most optimistic projections. A major 
goal of the Air Force research in optical processing is to increase the 
flexibility of these processors to the extent that they can perform the 
functions of most of the image processing algorithms which have pushed 
digital technology to its limit. 

I will discuss the Air Force research interests in terms of the 
following five categories: pattern recognition, image encoding, 

incoherent image processing, integrated optics and acousto-optics, and 
hybrid opto-electronic processing. The first three categories will 
receive the majority of attention since they deal with image processing. 

PATTERN RECOGNITION 

The basic type of optical pattern recognition system is the 
frequency plane correlator or matched filter shown in figure 1. The 
reference or target information is stored in the processor as the complex 
conjugate of the Fourier transform. If the target information is present 
in the input, a peak of light will appear in the output plane, and the 
location of the peak in this plane will correspond to the relative 
location of the target in the input scene. Figure 2 illustrates the 
optical correlator in a more operational scenario in which a real-time 
input device is employed in the input plane and a large array of 
potential target filters are stored in the frequency plane. The input 
device, known as a real-time spatial light modulator, functions as an 
incoherent-to-coherent image transducer. One such modulator concept 
currently being investigated (ref 1) is shown in figure 3, and consists 
of a photocathode, a micro-channel array plate, and an electro-optic 
plate. The electro-optic plate carries a high resistivity dielectric 
mirror on one side and a transparent conducting electrode on the other. 
The input signal illuminates the photocathode which produces an electron 
image of the optical intensity. The image is then amplified by the 
microchannel plate and deposited on the dielectric mirror. This charge 
distribution establishes a corresponding spatially varying refractive 
index in the electro-optic crystal which is capable of modulating a 
readout laser beam. Successful operation of this device has been 
achieved, and current research efforts are aimed at improving its 
operation so as to meet the original performance goals of 20 line 
pairs/mi 11 imeter spatial resolution, 1 KHz framing rate, an optical 
quality of less than one tenth wavelength distortion of the wavefront 
over a 25 millimeter diameter, a sensitivity of one nanojoule per square 
centimeter, and a long-term optical information storage capability on the 
order of weeks. Using a new internal processing mode, based on secondary 
electron emission, the research is proceeding to show that the device is 
capable of performing level thresholding, contrast reversal, contrast 
enchancement, image addition, and image subtraction. 
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A high-accuracy terminal guidance sensor system will consist of an 
acquisition subsystem that scans the scene in the target area, a 
reference subsystem containing a photographic or equivalent image of the 
target and its surround, and a processing subsystem that correlates the 
real-time scene with the reference image and generates error signals to 
vector the warhead onto the target. The fundamental difficulty in 
achieving a practical terminal guidance system lies in correlation of the 
reference image with a real-time image that differs in scale, aspect, 
contrast, and even content when sensed in a different spectral band or at 
a different time than the reference image. In addition to reliability 
and accuracy of correlation, the processor must be rapid to permit a 
reasonable guidance error signal bandwidth for fast target approaches. 

The resulting requirement is for a processing subsystem with throughput 
rates in excess of 10? operations per second, and optical correlation 
with its vast parallelness at low power and cost is a most promising 
technology. 

Various optical processing approaches to missile guidance are 
being compared on the basis of input resolution, grey scale requirements, 
quantity of stored reference imagery needed, cycle times required, 
accuracy expected, processing needed, etc. The major emphasis has been 
on finding a processor or correlator architecture that is capable of 
maintaining correlation in the presence of various image degradations 
that invariably occur between the input and reference function during 
terminal guidance. Among the differences that have been considered are: 
scale, rotation, resolution, contrast, relief, data dropout, aspect, 
seasonal, years apart, and headings. Techniques are being studied for 
optimizing matched spatial filter synthesis parameters to improve the 
signal -to-noise ratio; however, these techniques often accentuate the 
high frequency detail of the image, thus rendering the correlation more 
susceptible to image differenes between input and reference. Therefore, 
space variant processing techniques are also being investigated to 
realize optimum correlation systems invariant to the dominant 
degradations. For example, the realization of a rotation-invariant 
correlation system can be obtained by performing a polar transform on the 
input and reference imagery. The magnitude of the correlation peak 
remains unchanged while the location of the peak translates proportional 
to the rotational difference between the two images. It has been shown 
that a multiple-invariant, space-variant optical processor is possible in 
which two functions described by any number of separate distortion 
parameters can be correlated with no loss in the signal-to-noise ratio of 
the correlation (ref 2). Other correlation procedures are also being 
investigated such as optical homomorphic filtering, optical statistical 
pattern recognition, an optical analog of the invariant moment method, 
and a hyperspace expansion and clustering technique. 

Besides research into improved light modulation and novel 
filtering schemes for pattern recognition systems, the Air Force has an 
interest in holographic optical element (HOE) research. HOEs offer 
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advantages of a reduction in the mechanical stability problems of 
conventional optical systems, a reduction in the number of optical 
elements, and a reduction in weight, size, and cost, all of which render 
these elements very applicable to optical systems for aircraft and 
missiles. The current state of the art in HOEs is limited to transfer 
function representation of linear, space-invariant optical systems. 
Unfortunately, such systems represent just a small fraction of the 
optical systems of interest. For example, only imaging systems with unit 
magnification are strictly space-invariant. For a space-variant system 
the impulse response is a function of the position of the impulse or 
point source of light in the input plane of the system. This means that 
in order to completely specify the impulse response, the response must be 
known for impulses located at all possible points in the input plane. 
Investigations are curently pursuing a sampling theorem-based technique 
for characterizing 2-D space-variant optical systems. The holograms 
representing the various impulse responses are multiplexed using a 
technique which encodes the phase of each reference beam with various 
diffusers. Another approach being studied involves computer multiplexing 
of the individual holograms. The research goals include not only a clear 
understanding of the basic theoretical and practical limitations of these 
approaches but also the discovery of additional multiplexing techniques 
(ref 3) . 

The final area of research interest under pattern recognition 
deals with the nonlinear processing of imagery. This opens up a whole 
new realm of possibilities to optical processing. For example, a basic 
tool in pattern recognition is the histogram which may be synthesized 
from a succession of intensity level slices such as shown in figure 4. 
Ideally, a level slice operation converts a continuous tone image into a 
pattern which represents only those points or areas of the input image 
which have a given intensity level. Figure 4 shows two non-ideal slices 
but ones which are sufficient to construct a histogram. The histogram 
would be constructed from the level slices for all possible input 
intensity intervals. Note, however, that the transfer function (I 0 ut 
versus I-j n ) is a nonlinear operation. Three different approaches to 
nonlinear optical processing are currently being pursued: pulse-width 

modulation employing halftone hard-clipping and coherent filtering, 
direct use of light modulator nonlinear characteristics, and conversion 
of input intensities to spatial frequencies by employing the variable 
grating mode of liquid crystals. In the first approach, an input signal 
or image with continuous levels is converted into a halftone image that 
is binary. This halftone image consists of a periodic array of black dots 
whose sizes vary spatially according to the spatial variation of 
intensity in the input. This is accomplished by optically adding a 
halftone screen and the input image and then hard-clipping the resulting 
image. Nonmonotonic nonlinearities are obtained by employing various 
Fourier plane filters with an appropriate halftone screen design. The 
specific nonlinearity will depend on the halftone dot shape and on which 
diffraction order is passed by the filter. 
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The second approach is directed toward achieving nonlinearities by 
relying directly on nonlinear characteristics of electro-optical devices 
such as real-time spatial light modulators. For example, certain of 
these devices depend on an electro-optically controlled birefringence to 
produce a selective linear differential phase retardation along two axes 
of a crystal. Upon placing the crystal between crossed polarizers, a 
sinusoidal variation of intensity transmittance with voltage is obtained 
which can be the basis for many optically controlled nonlinear 
functions. One such function which has been demonstrated is three-bit 
optical parallel A/D conversion with a bit rate potential estimated at 
1.2 x 10^ points processed per second (ref 4). 

The third approach, the variable grating mode technique, is based 
on the fact that under certain conditions a linear, phase grating 
structure can be established in a liquid crystal, and the period of this 
grating can be changed by varying an externally applied voltage as 
illustrated by figure 5. By employing a photoconductor in connection 
with the liquid crystal, it should be possible to have a grating 
frequency whose spatial variations correspond to the intensity variations 
of an image projected onto the photoconductor (ref 5). If the resulting 
encoded image is input to an optical spatial filtering system (figure 6), 
the various spatial frequency components would be diffracted to different 
locations in the frequency plane and could be filtered to produce an 
output with any desired nonlinear relationship to the input image 
intensities. 


IMAGE ENCODING 


With an increased interest in image type operations on behalf of 
the military comes an increased interest in image transmission. Remotely 
piloted vehicles and battlefield surveillance are two military operations 
where image communications will likely play a prominent role. Two 
definite problems encountered in any long distance transmission of analog 
image information are limited bandwidth and atmospheric scattering. With 
regard to the bandwidth problem, real-time image compression schemes are 
needed. The mosit severe limitation on existing schemes is the extensive 
computational burden, a problem for which optics may indeed be able to 
provide a solution. Research has successfully formulated a modified dif- 
ferential pulse code modulation (DPCM) technique which is suitable for in- 
coherent optical implementation. Referred to as interpolated DPCM, or IDPCM, 
the technique employs a mask to spatially sample the image. These samples 
are then quantized and transmitted. In addition, the sampled image is in- 
terpolated by a low-pass convolution and a different image is generated by 
subtracting this interpolated image from the original. This difference image 
is also sampled, quantized, and transmitted. At the receiver, the image 
samples are interpolated to form a low-frequency version, and the 
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difference-image samples are added to the interpolator output to 
reconstruct the original. A simulation of this technique has shown the 
validity of employing optical processing for bandwidth compression. 
Efforts are continuing toward expanding optical compression schemes into 
the realms of interframe and adaptive compression (ref 6). 

Another approach being pursued is to employ optical transformation 
of the image followed by a delta modulation of the transformation. Two 
analog transform encoding schemes are being considered: one is a 

coherent optical Fourier transformation while the other is an incoherent 
optical Hadamard transformation. Real-time linear and adaptive delta 
modulators operate on the video signal obtained via a TV camera in the 
output plane of the optical system (ref 7). With regard to the Hadamard 
system, the image is broken up into a large number of sub-images by a 
fly's-eye lens, and each sub-image is weighted by a Hadamard mask. 

The second problem area, that of atmospheric scattering, may be 
solvable by employing some novel encoding process such as 
intensity-to-frequency conversion. This involves using a 
wavelength-coded source to illuminate the object so that each resolution 
element (pixel) is illuminated with a different wavelength, thus 
realizing a spatial-to-spectral conversion of the object information. At 
the receiver, the collected light is decoded by a spectroscope which 
displays the various wavelengths at the proper spatial positions so as to 
reconstruct the original image. Since scattered light does not undergo 
any wavelength shift, the information on each of the spectral carriers is 
free to travel any path, either direct or multiply scattered, between the 
transmitter and the receiver. It may even be possible to use such a 
technique to transmit three-dimensional images (ref 8). 


INCOHERENT IMAGE PROCESSING 


Since the discovery of the laser in the early 1960's, coherent 
optical processing has advanced steadily toward developing an information 
processing capability with speed-of-1 ight performance. Two major 
difficulties of coherent processing which have prevented a more rapid 
advancement are the nonavailability of suitable spatial light modulators 
and a relatively low dynamic range. Incoherent processing, of course, 
does not need the incoherent-to-coherent conversion step which the 
spatial light modulators perform for coherent processors. Furthermore, 
incoherent processing avoids the problem of coherent noise or speckle 
which often masks the effects of the desired image processing operation. 
By lessening the coherence of the system, one is, in effect, increasing 
the number of information bearing channels, all of which are carrying the 
same information in parallel. The major result of this redundancy is an 
increase in the signal to noise ratio. On the other hand, incoherent 
processing is not without problems, such as a built-up of dc-bias. 
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Overcoming the problems of incoherent processing, or at least 
significantly reducing their impact, is the goal of Air Force research in 
this area. If the fundamental problems can be solved, the ultimate 
objective will be to realize all optical processing operations with 
incoherent systems. Not only do incoherent processors offer a 
significant reduction in cost over coherent systems, but the mechanical 
alignment problems are much less severe, and this may be a very important 
point if optical processors are ever to find their way into airborne 
information systems. 

Optical processors use interference to process information; 
however, with incoherent light, interference is much more difficult to 
achieve. Consider the formation of a Fourier transform hologram. If the 
required reference beam does not pass through the Fourier transforming 
lens, its optical path will not exactly match the object beam path and 
there will be no interference. But if the reference beam does pass 
through the lens, it will be Fourier transformed into a point image and 
will not be able to uniformly cover the object beam. This problem is 
being investigated using a technique involving a chirp-z transform. This 
method achieves a Fourier transform in a way basically different from the 
usual way, which is to place the object on the input side of a lens and 
take the output from the back focal plane. In the chirp-z method, a zone 
plate lens overlays the object, producing a compound object. The light 
then passes through a spatial filtering system consisting of two lenses 
and a spatial filter, and the resulting output is a Fourier transform. 

By making the Fourier transform in this manner, the previously noted 
problem can be avoided and it becomes possible to insert the entire 
optical system into a three-grating achromatic interferometer. Thus, 
after achromatizing the Fourier transformation process and introducing, 
by means of the interferometer, an achromatic coherent reference beam, it 
becomes possible to produce Fourier transform holograms in completely 
white light (ref 9) . 

Another effort is working more directly on conceiving an 
incoherent matched filter correlator (ref 10). The concept demonstrated 
to date employs a color-compensating grating which collects 
correlation-signal energy over a band of wavelengths, which has been 
dispersed by the matched filter, and concentrates it in a single spot to 
achieve a much improved S/N for the correlation peak. However, 
correlator operation still suffers from the very narrow spectral response 
of the matched filter itself. Various combinations of zone plates and 
gratings will be tried in an effort to overcome this deficiency and 
create achromatic systems capable of performing the operations of Fourier 
optics in white light. 

A third approach to incoherent optical processing, while not as 
universal or flexible as the previous two techniques, has the advantage 
of being relatively simple to implement. A diffraction grating is 
employed in the input plane of the optical processor to effect a 
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sinusoidal modulation of the input signal. This results in two off-axis 
spectral distributions in the Fourier plane of the processor, and these 
distributions will be dispersed into rainbow color due to the use of 
incoherent light. The origins of the differently colored signal spectra 
will be linearly dispersed in a direction perpendicular to the grating. 

A complex spatial filter for each of a discrete set of wavelengths will 
be placed in the spatial-frequency plane to produce a set of mutually 
incoherent filtered signals. The output of this incoherent filtering 
scheme will be formed by the incoherent addition of the discrete spectral 
bands (ref 11). 


INTEGRATED OPTICS AND ACOUSTO-OPTICS 


Spurred on by the tremendous impact that electronic integrated 
circuits have had, the field of integrated optics has already progressed 
to the point where complete circuits are being attempted. Such basic 
elements as optical modulators, coherent light sources, lenses with near 
diffraction-limited performance, and optical detectors have been 
demonstrated and may now be combined on a single substrate to perform 
some practical signal processing operations. In an effort to further 
expand these processing operations, the Air Force is interested in 
supporting research that will advance the current state-of-the-art, 
especially in achieving more compatibility between electronic and 
optical integrated systems. Much research is still needed in the areas 
of programmable filters, more simplistic lenses, broader band surface 
acoustic waves for acousto-optic processors, and higher processing gains. 

Investigations are being conducted into Fresnel diffraction in an 
attempt to fabricate efficient and inexpensive waveguide lenses for 
integrated optical systems. Current lenses are either geodesic or 
Luneburg; however, geodesic lenses involve an expensive and 
time-consuming precision grinding process while Luneberg lenses have very 
limited focusing capabilities in high index-of-refraction waveguides 
because of the small change of the effective index that can be obtained 
from known materials. Fresnel lenses have the potential to outperform 
both the geodesic and Luneburg lenses. They consist of pads deposited on 
the waveguide which act to achieve either phase shift or absorption in 
alternate half-period zones (ref 12). 

A recent accomplishment in the area of programmable filtering was 
the demonstration of a 32-bit, 17 Mbit/sec digital correlator as shown in 
figure 7. The reference signal is programmed into an array of 
interdigital electrode sets fabricated on a thin buffer layer on the 
surface of a planar electro-optic waveguide. The input signal is encoded 
as one of two possible frequencies of a surface acoustic wave (SAW). If a 
perfect match exists between the input and reference signals, the optical 
energy which is first diffracted by the SAW is diffracted a second time 
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by the electro-optic grating produced by the interdigital electrode sets 
and is focused onto the detector. If a match does not exist, such as the 
case in the figure, some of the energy is diffracted to another point in 
space and does not reach the detector (ref 13). 

There will always be some loss and degradation due to scattering 
in coupling light from a lithium niobate waveguide to a detector array 
formed on silicon whether or not channel waveguides are present. Also, 
accurate alignment of two structures will always be required. These 
aspects could be eliminated if layers of single crystal silicon could be 
grown on lithium niobate. Such growth by conventional epitaxy is not 
possible. However, based on recent experimental results involving 
laser-assisted growth of single crystal silicon layers on amorphous 
substrates and formation of MOS devices on these layers, it may be 
possible to form silicon photodetector arrays on lithium niobate (ref 14) 

Acousto-optics offers a potential for unique integrated optical 
signal processing devices which, in part, is due to the huge difference 
between acoustic and electromagnetic propagation velocities. Currently 
under study are several novel thin-film acousto-optic and magneto-optic 
devices with applications to wideband multi-channel information 
processing, high-speed deflection and switching of a guided light beam, 
optical time-division multiplexing/de-multiplexing, and tunable optical 
filtering (ref 15) . 


HYBRID OPTO-ELECTRONIC PROCESSING 


Many physical problems, especially in simulation and control, 
require the real-time solutions to 2D and 3D partial differential 
equations. Under investigation is a hybrid optical-electronic computer 
capable of high speed solutions of such equations. The speed and 
versatility of the hybrid computer will enable it to handle a wide 
variety of complex problems. It will be especially useful in 
applications requiring partial differential equations to be solved many 
times while initial conditions, coefficients, nonlinearities, and 
nonhomogeneous terms are varied. A confocal Fabry-Perot interferometer 
forms the basis of the optical portion of the computer which performs the 
Fourier transformations, the spatial filtering, and the feedback (ref 16) 

The digital processor operates in an analysis mode with regard to 
a digitized version of the optical output and acts as a controller for 
the mirror position and the laser beam scanner which produces the input 
image and the spatial filters in real time. The feedback configuration 
allows the optical system to achieve a filtering dynamic range of 
approximately twenty times that of a more conventional system without 
feedback. However, even better performance is now being sought through 
the addition of gain to the system. By offsetting system losses, even 



modest amounts of gain incorporated into the Fabry-Perot can lead to 
dramatic increases in filtering dynamic range. Furthermore, sufficient 
gain would allow the system to operate as an optical operational 
amplifier. The technique being investigated involves flash lamp-pumped 
dye lasing. The dye amplifier is placed within the Fabry-Perot and used 
to offset losses by coherently amplifying a He-Ne signal. In actuality, 
a two-stage injection-locked dye laser/dye amplifier arrangement is 
employed to aid in preserving signal frequency and relative phase which 
is so important in the feedback system. 

The Air Force is not interested at this time in supporting 
investigations into optical logic or digital computing, but rather those 
areas in which optics can best complement electronic processing. One 
such area would be where massive parallelism is desirable to perform 
computationally intensive operations as, for example, in matrix inversion 
or eigenvector decomposition. Requirements for matrix inversion, such as 
solutions of simultaneous equations and various signal filtering 
operations, have established a need for a fast parallel computational 
method. The problem of real-time eigenvector and eigenvalue 
determination from a given matrix is also important in practice. Such 
operations play a role in pattern recognition. Another area which 
strongly complements electronic processing is the use of optics in VLSI 
interchip and intrachip communications. Optics may offer a technique for 
overcoming a fundamental limitation to device density — that of being able 
to further reduce the spacing between interconnects without appreciably 
increasing interference. The practical problems of using optics can be 
separated into two categories: getting suitable optical sources and 
detectors onto a chip, and realizing suitable holographic optical 
elements. 
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Figure 1.- Optical matched filter for real-time scene 
matching with output shown as a computer-generated 
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ABSTRACT 


A computational model of the processes involved in multispectral remote sensing 
and data classification is being developed as a tool for designing smart sensors which 
can process, edit and classify the data that they acquire. An evaluation of sensor 
system performance and design tradeoffs can be expected to involve classification 
rates and errors as a function of number and location of spectral channels, radiometric 
sensitivity and calibration accuracy, target discrimination assignments, and accuracy 
and frequency of compensation for imaging conditions. This model seeks to provide a 
link between the radiometric and statistical properties of the signals to be classified 
and the performance characteristics of electro-optical sensors and data processing 
devices. Preliminary computational results are presented which illustrate the editing 
performance of several remote sensing approaches. 


INTRODUCTION 


To overcome present inefficiencies in worldwide monitoring of resources and the 
environment by remote multispectral sensing, it is necessary to develop multispectral 
sensor systems which are "smart" enough so that they can be relied upon to perform 
such tasks as identifying and locating features of interest, editing out areas of 
extensive cloud cover and haze, and compensating for atmospheric variability. The 
development of such smart-sensor systems must take into account the complex natural 
variability of surface and cloud reflectance and atmospheric radiative transfer. 

To do so, smart-sensor concepts should be developed and evaluated first as models 
in the computer, and only thereafter, if promising, as actual devices and systems. 

A comprehensive computational model of the deterministic and stochastic processes 
involved in remote sensing is currently being developed as such a tool for studying 
multispectral sensor systems and concepts.-*- This model accounts for remote multi- 
spectral data acquisition and processing as a function of both deterministic and 
stochastic elements of solar irradiance, atmospheric radiative transfer, surface and 
cloud reflectance, and sensor response. The model differs from other related ef- 
forts^ - ' 7 in two aspects: One, it treats all elements of the remote sensing process as 

parts of a single system. Two, it specifically relates stochastic properties of the 
sensor 'signal to stochastic properties of the atmospheric radiative transfer and scene 
spectral reflectance. 

is 

Located in Hampton, Virginia. 
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In this paper we use the computational model of the remote sensing process to 
study earth feature identification algorithms for onboard data editing. The objective 
of the feature identification task is to distinguish between vegetation, bare land, 
water, clouds and snow. If such discrimination between these categories could be made 
with reasonable accuracy and computational simplicity, then onboard data processing 
could be relied upon to reduce drastically the amount of data that needs to be trans- 
mitted and processed for routine remote sensing operations. 

The analysis in this paper is limited to two spectral channels which are centered 
at wavelengths 0.65 and 0.85 ym. These spectral channels are particularly well-suited 
for distinguishing vegetation from other earth surface features. For that reason they 
have been selected for the Feature Identification and Location Experiment (FILE) on 
Shuttle-OSTA 1. These two channels correspond closely to channels 3 and 4 of the 
Thematic Mapper (TM) and Mult ispectral Linear Array (MLA)^ so that the results pre- 
sented in this paper are relevant to data processing studies concerned with future 
remote sensor systems for monitoring earth resources and the environment. Results 
presented in this paper compare the relative accuracy and computational complexity of 
three decision techniques for performing the feature identification task. The driving 
variables are atmospheric conditions, solar incidence angle, and spectral reflectance 
properties. No attempt is made here to distinguish between clouds and snow since 
clouds are most efficiently identified, and discriminated from snow and ice, in the 
reflected IR around 1.5 ym and the emitted IR around 10 to 12 ym (atmospheric window) .9“10 


REMOTE SENSING MODEL 


In this section we briefly review the computational remote sensing model developed 
by Buck et al. ^ The model accounts, as depicted in Fig. 1, for data acquisition and 
classification. Data acquisition must account for the solar irradiance, atmospheric 
radiative transfer, surface reflectance, and sensor response. In mathematical terms, 
data classification is that process that maps the very large sensor sample space into 
a much smaller space of predefined categories or features. It is essentially the 
feature identification algorithm that defines these categories. 


Radiative Transfer 

Deterministic Processes - When atmospheric attenuation (absorption and scattering) 
and Lambertian ground reflectance effects are taken into account, the radiance derived 
from solar spectral irradiance incident on a downward-looking sensor is represented as: 

L= — E T y pT+L pT + L (1) 

TToO O d p 

where E 0 EE 0 (A) is the solar spectral irradiance at the top of the atmosphere; 
T 0 ET 0 (A,T,y 0 ) is the atmospheric transmittance along the incident path from the sun to 
the surface (solar zenith angle = 0 O , y o =cos0 o ) ; L d =L d (E 0 ,A,T,y 0 ,p) is the diffuse sky 
spectral radiance which results from all radiation scattered downward onto the surface 
(i.e., integrated at the target over elevation and azimuth); pEp(A) is the spectral 
reflectance of the surface (sometimes called "signature"); TET(A,T,y) is the atmosphere 
transmittance along the exitant path from the surface to the sensor (zenith angle = 0, 
y=cos0) ; and LpELp (E c , A ,t ,y o ,y , (J>) is the path spectral radiance which results from all 
radiation scattered upward along the path from the surface to the sensor. The other 
parameters are wavelength, A, optical thickness of the atmosphere tEt(A), and azimuth 
angle ((> between the planes of incidence and exitance. The component of the total 
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radiance L which arises from radiation reflected from the target is referred to as the 
beam spectral radiance Lp , that is * L^L-Lp and (E 0 ,X ,T ,p Q ,p ,p) . 


The optical thickness t=t(X), which governs atmospheric transmittance, sky and 
path radiance, is given by 


N 

t = .E- a. x. 
i=l i 1 


( 2 ) 


where a^^a^(X) is the attenuation coefficient of the ith atmospheric constituent and 
x^ the associated attenuator amount (often denoted up in the radiative transfer liter- 
ature) . The atmospheric transmittance over the incident path is given by e*“ T ^o, and 
that over the exitant path by e” T /b. The rigorous treatment of multiple scattering 
which leads to sky and path radiance is very difficult and computationally expensive. 

To use documented atmospheric radiative transfer models and to keep computations with 
the deterministic/stochastic model economical, we use the AFGL LOWTRAN 4^ model to 
account for attenuation in atmospheric absorption bands and the ERIM model (developed 
and described by Turner^) to account for scattering. For the spectral region ex- 
tending from 0.4 to 1.0 pm, the atmospheric radiative transfer is primarily affected 
by Rayleigh scattering (by air molecules N2 and O 2 ) , scattering by aerosols (haze drop- 
lets and dust), and absorption by ozone (O 3 ) , water vapor (H 2 O) , and molecular oxygen 
( 0 2 )- 


The ERIM radiative transfer model is used to account for single and multiple 
scattering in the atmosphere for a surface with a simple geometric pattern. We assume 
a horizontally homogeneous atmosphere which is bounded by a surface that consists of 
an infinitesimally small target and a large surrounding background, both of which have 
uniform diffuse (Lambertian) reflectances. The model accounts for realistic aniso- 
tropic phase functions and vertical changes in the atmosphere and for attenuation by 
ozone absorption and aerosol and molecular scattering but not for attenuation by water 
vapor and molecular oxygen. This limitation constrains the application of the ERIM 
model to the evaluation of spectral responses that do not significantly overlap the 
major H 2 O (0.86 to 0.99 pm) and O 2 (0.75 to 0.77 pm) absorption bands. More comprehen- 
sive computational models are available but their use as subroutines would be too 
expensive. 

The difference between the LOWTRAN 4 and ERIM models can be expressed in terms of 
Equation 1. Both models calculate the first term in the same way. LOWTRAN 4 does not, 
however, include either the second or the third terms which represent scattered solar 
radiation. 

Stochastic Processes - We regard the spectral radiance LEL(X) that reaches the 
sensor as a stochastic process whose value at each wavelength depends upon a number of 
random variables associated with both the atmosphere and surface. Letting the opera- 
tion E{*} denote the expectation (average) taken over the ensemble of all possible 
radiances associated with a particular surface, the mean <L(X)> and autocovariance 
C L (X,X’) of the radiance can be expressed as 

<L (X) > = E{L(X)} 


and 


C L (X,X’) = E{[L(X) - <L(X)>][L(X’) - <L (X 1 ) >] } 

To simulate the effects of atmospheric variability, we assume that the attenuator 
amounts x± in Equation 2 are random variables with a known mean Xj_ and covariance G^p. 


19 



It is particularly convenient, although not necessary for the purpose of simulation, to 
assume that the vector of attenuator amounts is multivariate Gaussian. Therefore the 
optical thickness will have a log-normal distribution. 

To simulate the effects of surface reflectance variability, we model the reflect- 
ance of a particular target surface by 

p(X) = p o (A) e ~ X ° 3 ° (A) 

where p c (A) and $ 0 (A) are deterministic functions which are characteristic of the sur- 
face, and x Q is a standard normal random variable with mean = 0 and variance ~ 1. For 
each surface the parameters p 0 (A) and 3 0 (A) are estimated from empirical reflectance 
data. 


Signal Vector and Reference Pattern 


The sensor converts the radiance L into the signal vector s with components Sj 
which are represented by the simple model 

s. - /°° L(A) S. (A) dA 
1 ° 1 

where Sj (A) is the deterministic spectral response of the j th channel. Effects due to 
electronic noise and errors in radiometric calibration are not treated here, in order 
to simplify the following formulations. 


Because the radiance is stochastic, the signal vector s is a multivariate random 
variable whose mean r and covariance C have components denoted by 


and 


r . = E{s . } 
J 1 


c.. = E{(s, - r.)(s.t - r,)} 

33 333 3 

where both j and j ! take on the values 1, 2, J, and J is the total number of chan- 

nels. It follows from the linearity of the signal conversion process that the refer- 
ence patterns can be computed as 


and 


r . 
J 


= f°° <L(A)> S . (A) dA 
o 3 


c , , = f°° f°° C 7 (A, A’) S. (A) S., (A 1 ) dA dA ’ 

JJ o o L j 3 

RADIOMETRIC PROPERTIES AND SENSOR RESPONSES 


In this section we present typical radiometric properties encountered in remote 
sensing and the spectral responses of the FILE sensor system. Our characterizations 
here are confined to a spectral region of 0.4 to 1.0 pm. 


20 



Model Inputs 


Solar irradiance - The solar spectral irradiance E 0 (A) incident upon the top of 
the atmosphere is relatively well known and is shown in Figure 2. Its variability is 
small' compared to other uncertainties and thus is ignored here. 

Surface reflectance - Table 1 summarizes the categories and substances used as 
examples in this investigation as well as the (assumed) standard deviations of their 
reflectances. The spectral reflectance curves are shown in Figure 3. Typical vari- 
ability of spectral reflectances realized in the simulation is shown in Figure 4. 

Atmospheric properties - The mean whole-atmosphere (i.e., vertical path) attenu- 
ator amounts for each of these atmospheric constituents, along with reasonable values 
of their standard deviations based on estimates of climatological variability, are 
listed in Table 2. All attenuator amounts are assumed to be uncorrelated, except for 
water vapor and aerosol which are assumed to have a perfect positive correlation. 

Figure 5 shows a plot of typical average radiance components. Figure 6 shows a typical 
realization of simulated radiance variability using the LOWTRAN 4 model. 

Sensor response - The FILE system consists of two sensors, one centered at .65 \lm 
and one centered at .85 ym. These are relatively narrow-band sensors spectrally (20 
nm bandwidth) , compared to the 100 to 300 nm bandwidth used in the LANDSAT sensor 
system. The sensor bandwidths do not overlap any major atmospheric absorption bands. 
The FILE system includes an onboard categorizer that assigns each pixel to bare land, 
vegetation, water, or snow/clouds. The FILE sensor channels were originally chosen 
because of the desirable properties of the ratio of the two channel outputs. Also, 
channel ratios have some useful compensation effects for sun angle and atmospheric 
variability. 


FEATURE IDENTIFICATION TECHNIQUES 


In this section we describe three methods that could be used to identify (edit) 
such categories as vegetation, bare land, water, and cloud/snow. Two of these methods 
are closely related; they use either the maximum likelihood (MLH) or mean-square dis- 
tance (MSD) classification algorithms to classify the signal vectors according to 
reference patterns (or training sets) and then collect all classes into categories. 

The third method, referred to as the boundary approximation method (BAM), avoids the 
classification step and assigns the signal vectors directly into categories. 


Classification 

A very common classification procedure which makes use of covariance information 
is based upon maximizing the (assumed) Gaussian PDF. Specifically, a particular signal 
vector, s, representing J spectral channels, is classified as spectral type n, provided 
the Gaussian PDF of s conditioned on spectral type n is largest, i.e., provided 

PDF(s|n) = Y/oT rr 77 exp ^ (s_rn) (s_r nJ (3) 

(27T) J/Z |c n | 1// 

is greater than PDF (s|n T ) for all other spectral types n ! = 1, 2, ... N. To avoid 
the computational expense of evaluating the exponential, an equivalent classification 
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procedure can be used which minimizes -log e [PDF (s|n f )]. This, in turn, is equivalent 
to determining the n which minimizes 

(s-r ) t C ^ (s-r ) + log 
n n n e 

where | * | denotes determinant. If the a priori probability of each spectral type is 
known (and all types are not equally likely) , it is often desirable to weight the 
classification procedure by this prior information. However, no weighting is used for 
the results presented in this paper. For this classifier, the mean and covariance 
matrix uniquely determine a class. This is the maximum likelihood classifier (MLH) . 



Several simplifying assumptions concerning the nature of the covariance matrix, 
given imperfect knowledge, can be made. In the special case of 

C = I 
n 

where I is the identity matrix, minimizing Equation 4 becomes equivalant to minimizing 

(s-r ) t (s-r ) 
n n 

This is commonly called the mean-square distance (MSD) or Euclidean-distance classifier. 
From a computational standpoint, we eliminate the evaluation of one vector-matrix 
(lxn * nxn) multiplication and the addition of the logarithm term. For this special 
case the mean uniquely determines the class. 


Editing 

Aggregation method - The MLH and MSD classification algorithms work with classes 
but can be used to distinguish between categories such as vegetation, bare land, water, 
and clouds/snow by collecting a set of classes to define each of the four categories. 
For example, if one can pick some small number of vegetation classes that approxi- 
mate (span) the vegetation category then one has a method of categorizing vegetation. 

In other words one can blanket the area, in spectral space, occupied by the vegetation 
category with a set of classes. Computationally , what can be done is to actually 
assign a signal to a class (according to the chosen decision rule) but to only retain 
the category within which that class falls. For example, a signal from a corn target 
might be classified as wheat but the editor retains only the fact that it has been 
categorized as vegetation. The overall accuracy of categorizing will be higher than 
the overall classification accuracy because intra-category class confusion is not 
counted as wrong. We call this method categorization by aggregation . 

Boundary approximation method - In discussing this editing technique frequent ref- 
erence will be made to covariance ellipse plots. These ellipses are defined by the 
following equation: 

(x-r ) t C 1 (x-r ) = 1 
n n n 

where x is the channel vector. If the nth class was truly multivariate-Gaussian then 
this ellipse would enclose 66% of the data. The ellipses are centered about the class 
mean. These plots provide a very useful approximation to the scatter of the data due 
to such things as the variability in radiance shown in Figure 6. Figure 7 shows el- 
lipse plots for simulated data for a total of twenty classes. This data represents a 
simulation for 23 km visual range and a solar zenith angle of 30° , using the FILE channels . 
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As stated previously this particular set of FILE spectral bands was chosen because 
of the properties that the channel ratio possessed. Figure 7 shows 3 lines superimposed. 
The lines are chosen so as to delineate the four categories. These lines represent 
simple approximations to the boundaries of the categories in spectral space. Hence the 
name Boundary Approximation Method (BAM) . The lines drawn are not strictly defined by 
the channel ratios. It was decided that, by adding an offset to the two diagonal equ- 
ations, much better performance could be achieved. This causes only a slight increase 
in computational costs. The third line represents an "absolute" radiance threshold 
which forms the boundary between the bare land and snow/cloud, categories which have 
similar channel ratios. However, this threshold has a sensitivity to solar zenith 
angle. The three boundaries are defined by the following equations. If Xj_ is the 
value of channel i then the three lines are defined as: 

X 2 = 1.45 x-, - 4.1 
x 2 — 3.0 x^ — 2.1 
X 2 = 16.5 

As one can see this method is very simple in terms of the calculations necessary to 
categorize a signal and is similiar to the standard parallelepiped classifiers. This 
method represents the opposite end of the spectrum, in terms of computational complex- 
ity, from the aggregation method. 

Space qualified data processing hardware is still rather expensive and bulky. Any 
implementation of onboard processing of multispectral data will encounter very conser- 
vative limits in terms of available processing and storage capacity. This, in turn, 
limits the computational "costs" that can be incurred in identifying features. Esti- 
mates of the computational complexity of the two methods of aggregation were made after 
the formulas were simplified and modified as much as possible. No provision was made 
for special processing architecture (i.e., parallelism) in the estimates. For the 
boundary approximation technique we have two equations to evaluate with 1 add and 1 
multiply each for a total of 4 add/multiplies. The number of add/multiplies per pixel 
necessary to do the MSB classification, and thus editing, can be approximated by 2LJ 
where L is the number of classes and J is the number of channels. Similarly, the 
number for PDF can be approximated by (L/2) (J^+3J) . Those interested in more details 
might start with Ref. 13 and 14. 


COMPUTATIONAL EXPERIMENTS AND RESULTS 
Experiments 

In this section we present results of computational experiments which illustrate 
the performance of the MLH and MSD aggregation and the boundary approximation methods 
in distinguishing between such spectral features as vegetation, bare land, water, and 
cloud/snow from data obtained with the two FILE spectral channels. These two chan- 
nels are centered at 0.65 and 0.85 ]im and thus approximately coincide with TM/MLA 
sensor channels 3 and 4, The results provide insight into the susceptibility of 
feature identification accuracy on the natural variability of atmospheric radiative 
transfer and target and background spectral reflectance. 

The computer program (previously described) simulating the performance of an 
orbiting multispectral sensor generates pseudo-random observations (i.e., signal 
vectors) which are assembled into training sets for computing a reference pattern 
library. Subsequently, observations are generated for input to the editor/classifier 
which utilizes the reference pattern library or other criteria to make its decision. 
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Editing accuracy (i.e., the ratio of correct categorizations to the total number of 
observations) is used as a figure of merit to determine sensor performance. Training 
sets and editing accuracies are based upon a total of 100 observations per target. 

In practice, classification using the elements of a reference library will always 
be compromised since these elements must be based upon values for the means and covar- 
iances obtained at one specific set of conditions. For example, the reference library 
may have been compiled when the atmosphere was relatively clear, while the observations 
were made when the atmosphere was hazier. This error source has been included by com- 
puting the reference vector r and covariance matrix C for each target at a visual range 
of about 23 km (representing a moderately clear atmosphere) , and the signal vectors s 
either at the same visual range or at reduced visual ranges of about 10 km and 5 km. 
Similarily, to assess the effect of changes in solar incidence angle, reference data 
were obtained for a solar incidence angle of 30°, whereas feature identification deci- 
sions were made either at the same or at 40° solar incidence angle. The sensor viewing 
geometry remained vertical. 


Results 

Figure 8 summarizes the predicted feature identification accuracy, and Table 3 
presents the amount of computations required for the three decision processes. The 
feature identification accuracies are complex functions of changes in task assignment, 
visual range, and solar incidence angle. Nevertheless, it can be concluded that the 
MLH and MSD aggregation methods provide consistently higher feature identification 
accuracies than the BAM and also tend to be less susceptible to changes in both visual 
range and solar incidence angle. However, this improvement is gained at a substantial 
increase in computational and storage requirements. These requirements would increase 
rapidly for the MLH and MSD aggregation methods with increases in the number of sub- 
stances to be accounted for and in the number of spectral channels to be used. One 
can also see that for a 3-fold increase in computational cost little or nothing is 
gained in terms of increased accuracy or sensitivity to sun angle by using the MLH as 
opposed to the MSD. The results in figure 8 serve to show that small changes in the 
mix of classes, observed by the sensor, can have as important an effect as the other 
sources of error, such as atmospheric degradation. Specifically, ripening barely 
lacks the strong chlorophyll absorption band that the .65 pm channel is designed to 
detect in vegetation. Therefore, barley is likely to be confused with bare land. 

These results suggest that BAM could become a useful algorithm for spectral feature 
identification if it is expanded to use more than 2 spectral channels and if changes in 
solar incidence angle could be accounted for. It would, of course, also be desirable 
to account for changes in visual range; however, this would be more difficult. 

In practice the exact formulation of an algorithm depends on the particular task. 
For example, in editing out clouds, errors of omission, whereby we retain some cloud 
data, are less troublesome than errors of commission, whereby we edit out data from 
other categories. In general, if one particular category is of interest the boundaries 
for that category can be "relaxed" somewhat in order to pick up marginal outliers. In 
more complete analysis the a priori probabilities of each of the categories are needed 
in order to optimize the omission-commission trade-offs. This is also true for the 
aggregation method. 


CONCLUDING REMARKS 

Two fundamentally different approaches to feature identification, aggregation and 
boundary approximation for onboard data editing have been examined. Relatively small 
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differences in accuracies between the simplest boundary approximation and the com- 
paratively complex aggregation method were found, whereas the difference in compu- 
tational requirements is very large. Further investigation should be performed to 
determine improvements in feature identification accuracy that may be achieved by 
careful selection of both the number and location of spectral channels. 

The results obtained using the computational model were generally consistent with 
practical experience with Landsat data in terms of sensitivity to changes in haze, sun 
angle, and to the set of targets defined. The use of this model as a tool in the pre- 
liminary design and evaluation phase of remote sensing systems should prove valuable. 
To improve computational accuracy it will be necessary to account for the probability 
of occurence of various targets and the probability of encountering specific atmo- 
spheric conditions. 
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Table 1. Targets and the Assumed Standard Deviation of Their Reflectance 





Category 

Substance 

Standard deviation 
of reflectance, 0 

P 


I. Vegetation 

Barley 

.1 


Wheat 

.1 


Oats 

.i 


Corn 

.i 


Aspen 

.1 


Red pine 

.1 


White pine 

.1 

II. Bare land 

Chernozem- type soil, Nebraska 

.125 


Pedialier-type silt, Arkansas 

.105 


Pedocal-type soil, Ohio 

.105 


Pedocal-type soil, Nebraska 

.02 


Quartz sand, Oregon 

.140 


Clay, Missouri 

.055 


Red quartz and calcite sand, Utah 

.105 


Pedocal-type soil, Oklahoma 

.09 


Concrete road 

.1 


Asphalt road 

.1 

III. Water 

Sea water 

.06 

IV. Cloud/Snow 

Optically thick cloud 

.1 


Sugar consistency snow 

00 

o 
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Table 2. Attenuator Amounts in Vertical Column of Atmosphere 


Average value, x 


Standard deviation, 0 


Attenuator 


Air (N 2 , 0 2 ) 

Aerosol: 23 km visual range 
10 km visual range 
5 km visual range 

Ozone (O3) 

Water vapor (H 2 0) 

Molecular oxygen (0 2 ) 


LOOT RAN 

atm 

» cm 

// 

m 

-2 

LOOTRAN 

atm 4 

4 cm 

// 

m 

-2 

8.0 km 

8.0 x 

10 5 

2.2 

X 

29 

10 * 

0.24 km 

2.4 

X 

10 4 

6.5 

X 

10 27 

1.5 km 

1.5 x 

10 5 

4.0 

X 

00 

CM 

O 

1 — 1 

0 . 5 km 

0.5 

X 

10 4 

1.4 

X 

1q28 

3.0 km 

3.0 x 

10 5 

8.1 

X 

CO 

CM 

O 

1 — 1 

1.0 km 

1.0 

X 

10 5 

2.7 

X 

10 28 

5.0 km 

5.0 x 

10 5 

1.4 

X 

29 

10 

1.7 km 

1.7 

X 

10 5 

4.6 

X 

10 28 

0.34 atm* cm 

3.4 x 

io' 1 

9.2 

X 

22 

10 

0.10 atm* cm 

1.0 

X 

10" 1 

2.7 

X 

10 22 

1.14 g/cm^ 

1.14 

3.1 

X 

10 23 

0.36 g/cm^ 

3.6 

X 

lo*" 1 

9.7 

X 

10 22 

1.7 km 

1.7 x 

10 5 

4.7 

X 

10 28 

0.12 km 

1.2 

X 

10 4 

3.2 

X 

10 27 


Table 3. Computational Requirements for 2 Spectral Channels 


Decision Add/Multiplies with 

Process number of targets: 

17 20 


MLH 

187 

220 

MSB 

68 

80 

BAM 

4 

4 
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Figure 3.- Spectral reflectances. (See Table 1 for standard deviation 

of reflectances.) 





Figure 4.- Typical realization of simulated variability of 
three spectral reflectances. 
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TRAL RADIANCE, 


35 



WAVELENGTH, fjxn WAVELENGTH, yum WAVELENGTH, /tm 


Figure 5.- Average beam, path, and total radiance for com as target 
and average soil as background using ERIM radiative transfer model. 
Exitance is vertical and solar incident angle is 30°. 



3 .4 .5 .6 .7 .8 .9 1.0 1.1 .3 .4 .5 .6 .7 .8 .9 1,0 1.1 .3 .4 .5 .6 .7 .8 .9 1.0 1.1 

WAVELENGTH, /4m WAVELENGTH, /4m WAVELENGTH, /4m 

Figure 6.- Typical realization of simulated variability of spectral radiances 
incident on the multispectral sensor for three visual ranges using LOWTRAN 4. 
Target is quartz sand and solar incident angle is 30°. 
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Figure 7.- Covariance plot of signal from two spectral channels for 23 km visual range and 
30° solar incidence angle, and regions used for the boundary approximation method (BAM) 
decision process. 
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TARGETS: 
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20 



VISUAL RANGE 


(a ) Number of targets is 17 or 20, and data is acquired at a solar incidence angle 
(SIA)of 30°. 
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ACCURACY 
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Figure 8.- Feature identification accuracies attained with three 
decision processes: maximum likelihood (MLH) and mean-square 

distance (MSD) aggregation method and boundary approximation 
method (BAM) . 
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ROLE OF OPTICAL COMPUTERS IN AERONAUTICAL 


CONTROL APPLICATIONS 


Robert J. Baumbick 
NASA Lewis Research Center 
Cleveland, OH 44135 


ABSTRACT 

More complex aircraft and mission requirements will require larger, faster 
digital computers to perform a variety of functions. For adaptive control, 
controller gains are computed as a function of flight conditions. For multi— 
state problems, high speed computing is required to perform matrix/vector and 
matrix/matrix operations in order to compute these controller gains. 

Continued safe operation of the plant has to be maintained in the event of 
sensor failure. Accommodation of sensor failures requires a system math 
model that operates in real-time. The model, used to calculate an estimate 
of the failed sensed variable, should ideally be a full nonlinear model. 
Currently simplified models are used because of computing limitations. 

Because of the above requirements, research is being done to determine the 
role that optical computers might play in aircraft control. The optical 
computer has the potential high speed capability required, especially for 
matrix/matrix operations. The optical computer also has the potential for 
handling nonlinear simulations in real-time. 

In addition to the potential high speed capability of optical computers, they 
may also be more compatible with fiber optic signal transmission. Fiber optics 
offer advantages over conventional wire systems. One attractive feature of 
fiber-optic systems is immunity to noise generated from other electromagnetic 
sources and immunity to lightning strikes. Optics also permit the use of 
passive sensors to measure process variables. No electrical energy need be 
supplied to the sensor. Complex interfacing between optical sensors and the 
optical computer may be avoided if the optical sensor outputs can be directly 
processed by the optical computer. 
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Summary 


More complex aircraft and mission requirements will require larger, faster 
digital computers to perform a variety of functions. For adaptive control, 
controller gains are computed as a function of flight conditions. For multi 
state problems high speed computing is required to perform matrix/ vector and 
matrix/matrix operations in order to compute these controller gains. 

Continued safe operation of the plant has to be maintained in the event of 
sensor failure. Accommodation of sensor failures requires a system math model 
that operates in real-time. The model, used to calculate an estimate of the 
failed sensed variable, should ideally be a full nonlinear model. Currently, 
simplified models are used because of computing limitations. Because of the 
above requirements, research is being done to determine the role that optical 
computers might play in aircraft control. The optical computer has the 
potential high speed capability required, especially for matrix/matrix 
operations. The optical computer also has the potential for handling nonlinear 
simulations in real-time. 

In addition to the potential high speed capability of optical computers, they 
may also be more compatible with fiber optic signal transmission. Fiber optics 
offers advantages over conventional wire systems. One attractive feature of 
fiber-optic systems is immunity to noise generated from other electromagnetic 
sources and immunity to lightning strikes. Optics also permit the use of passive 
sensors to measure process variables. No electrical energy need be supplied 
to the sensor. Complex interfacing between optical sensors and the optical 
computer may be avoided if the optical sensor outputs can be directly processed 
by the optical computer. 

NASA Lewis Research Center is supporting two grants for basic research on 
optical computing and how it may be used in aeronautical applications. 
Carnegie-Mellon University is developing a hybrid electro-optic computer capable 
of performing matrix operations such as inversion, multiplication, etc. This work 
considers an analog optical computer. 

Ohio State University is developing a digital optical processor. This work 
centers around a liquid crystal light valve which is used to perform various 
logic operations and arithmetic operations. Both binary and residue arithmetic 
are being considered for these applications. Residue arithmetic is potentially 
more attractive because of the inherent parallelism of this type of arithmetic. 

The hybrid and digital optical processors both offer potential improvements in 
computing speed compared to all-electronic processors, but both require 
innovations in device technology before they can be used in practical 
applications . 
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Introduction 


This paper discusses research programs sponsored by NASA Lewis Research Center 
for work on optical computers. The objective of this research is to determine 
the role optical computers may play in future aircraft engine systems. Fiber 
optics are attractive for engine applications because of optics’ inherent 
immunity to electromagnetic interference (EMI) and because optical signals can 
be safely transmitted through areas that contain explosive materials. A program 
was undertaken to develop optical sensors for engine control. Optical links were 
considered to be superior for signal transmissions aboard the aircraft, especially 
for off-engine mounted control computers. Currently, engine control computers are 
mounted on the engine. Special environmental packages must be designed to protect 
the computer from the harsh temperature and vibration environment. Off-engine 
mounted computers would be located in a more benign environment. Along with the 
optical sensors and optical transmission lines, optical computers are considered 
in this future technology program to determine if and how optical computers could 
be used. Optical computers are considered because of potential speed improvements 
over conventional processors. Future aircraft computers will be faster and will 
have a higher capacity to handle the more complex engines of tomorrow. Some of 
the functions onboard computers might be called on to perform, if the computers 
could operate in real-time and had the required capacity, are adaptive control, 
engine condition monitoring, and sensor failure accommodation. For adaptive 
control the computer must perform vector/matrix and matrix/matrix calculations. 

These operations are required to calculate optimal controller gains for multi- 
variable control. 

Engine condition monitoring requires an accurate nonlinear simulation of the 
engine, and this simulation must operate in real-time. Sensor failure accommodation 
also requires a nonlinear model to provide accurate state estimates of failed 
sensor states. Real-time simulation of nonlinear systems is, therefore, important 
for safe, efficient operation. 

The optical computer has the potential for high speed operation required for 
adaptive control and has the potential for handling nonlinear simulations in real- 
time. Complex interfacing between optical sensors and the optical computer may be 
avoided if the optical sensor outputs can be directly processed by the optical 
computer. NASA Lewis Research Center is supporting two grants for basic research 
into optical computing and how optical computing could be used in aeronautical 
applications. Carnegie-Mellon University is developing a hybrid electro-optic 
computer capable of performing matrix operations such as inversion, multiplication, 
etc. Ohio State University is developing a digital optical processor. A liquid 
crystal light valve (LCLV) is used as the central computing element. With the 
LCLV, logic operations and arithmetic operations can be done. 
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Hybrid Electro-Optical Processor 


Figure 1 illustrates possible steps used in calculating the optimal controller gains 
for adaptive control. From sensed information about the system to be controlled, a 
linearized model is developed using various identification techniques. Given a 
performance index, optimum control gains are calculated for the multivariable system 
by solving the Matrix EJLccati Equation. At present this procedure is carried out on 
the ground. It would be desirable to do this onboard since the optimum gains could 
be updated to reflect changes in engine characteristics or variability from engine 
to engine. To do this onboard would require fast solutions to the Matrix Riccati 
Equation. Typically, a solution of the Matrix Riccati Equation takes hundreds of 
msec on an electronic computer. Using an optical computer to perform the matrix 
operations, solution times of a msec or better are possible. 

The Carnegie-Mellon optical vector/matrix multiplier is shown in figure 2. The 
values of the components of the vector x are represented by the intensities of 
the light emitting diodes (LED) or Laser Diodes (LD) . The LED outputs are 
connected to the matrix mask "A" with optical fibers. Variable transmittance of 
the cells of the "A" matrix represent the values of the elements a... Because the 
A matrix can be bipolar, scaling and biasing to accommodate bipolar^ values must be 
done. These scaled, biased values are the values of the a. t elements. The product 
of the intensity and mask cell transmittance are summed anS^ focused on a linear 
detector array. This operation is performed rapidly on the optical computer 
because all operations are done in parallel. When combined with a microprocessor, 
the optical matrix/vector multiplier can be used to iteratively solve equations 
of the form Ax - y where x is unknown (figure 3) . A guess is made for the x vector. 
A new x value is calculated, and when the last value of x and the new value of x 
are equal wjthin a prescribed tolerance, the solution converges and the implicit 
solution A y = x is obtained. The steady-state Matrix Riccati equation can be 
reduced to the form Ax = y and solved by this method. 

A photograph of the vector/matrix multiplier hardware used in lab demonstrations 
is shown in figure 4. The LED array shown is sealed with white RTV compound and 
bolted to the fiberoptic element, matrix mask and detector array. To have any 
practical application the mask used to represent the "A" matrix must be 
alterable to accommodate changing values of the elements a. , . This concept 
does have the potential for rapid solutions of the Matrix Mccati Equation. 

Digital Optical Processor 

A second grant sponsored by NASA Lewis for work in optical computing is with Ohio 
State University. The objective of this work is to build and demonstrate optical 
analogs to digital components to perform various combinational and sequential logic 
functions. The main element used in this work is a liquid crystal light valve. 
Residue and binary arithmetic are both being considered for use by this optical 
computer . 

The liquid crystal light valve shown in figure 5 operates in a controlled 
birefringence mode. Birefringence means that the refractive index is different 
when the polarization is in the direction of the principal axes compared to 
polarization at right angles to the axis. In the off state the cigar-shaped 
liquid crystal molecules are parallel to the glass face. This state has finite 
birefringence. This birefringence is decreased with application of the AC voltage. 
The molecules tip as a function of the excitation and light input. 
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Figure 6 illustrates how the LCLV operates. The electrical excitation can be 
adjusted to either of two values. Zero optical input produces either no change 
in the read beam or a rotation of 90 in the polarization of the read beam. 

The intensity of the input light beam is set so the complementary rotation is 
achieved when the beam is present. 


The face^of the liquid crystal light valve can operate on approximately 100 000 
spots/in (figure 7). Each spot is independent of its neighbors. This enables 
the LCLV to be used for operating on 2-D arrays of binary data. The use of 
residue arithmetic with this LCLV will make optimum use of the parallelism of 
the LCLV. This LCLV, with the extremely high density of spots, has potential 
for very large data transfer rates. 

Ohio State has demonstrated a number of logic functions using the LCLV. Figure 
8 illustrates an optical AND operation. The LCLV is biased to produce 90 
rotation of polarization with bright input. As long as A and B spots both have 
bright inputs there is an output. Should either A or B go dark the output goes 
to zero. 

Figure 9 shows an optical latch circuit. The Gian Thompson prism reflects 
vertically polarized light and transmits horizontally polarized light. The LCLV 
is biased to produce a 90 rotation of the reflected beam when the input to the 
back of the LCLV is bright. The latch is configured to load the data input when 
the clock is bright (high) and to store the input when the clock is dark (low) . 

The stored output Q is the complement of the input; that is, when the input is 
bright, Q will be dark and vice versa. If both the clock and the input are bright, 
the input beam will be rotated 90 to vertical polarization and pass through the 
analyzer to supply a bright signal to the back of the LCLV. The horizontally 
polarized light from the DC light source will be rotated to vertical polarization 
upon reflection from the LCLV__and will be reflected by the Gian Thompson prism, 
thus producing a dark output Q. When the clock goes dark, the dark output Q will 
remain regardless of changes in the input. This occurs because the DC light source 
will continue to illuminate the back of the LCLV through the feedback loop. 

Similar reasoning can be used to explain the loading and storage of a dark 
input which produces a bright output Q. 

The LCLV used in these experiments is very slow, taking hundreds of millisecs 
to operate. For practical systems the spatial light modulator must operate 
in msecs or better to achieve the high data rates required for real-time 
computing. 
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Concluding Remarks 


In summary, future aeronautical control systems will be more complex. Many 
functions not performed now and many functions done on the ground would better 
be done onboard if the speed and capacity of the computer were adequate. Among 
these functions are providing a real-time engine simulation for engine condition 
monitoring and for sensor failure accommodation. For adaptive control, onboard 
optimum multivariable control design would be done. For this to be done, rapid 
solutions of the Matrix Riccati Equation are required. Optical computers show 
some potential for applications to engine control systems because of their 
extremely high speed. Large data transfer capabilities and very fast solution 
of matrix equations are possible. However, much work has to be done in the 
area of faster spatial light modulators, optical read/write memories, and 
addressable alterable masks. These technology developments together with 
integrated optics may result in a practical electro-optic computer that could 
be used in engine control systems. 
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OPTICAL DIGITAL COMPUTERS * 

Alan Huang 
Bell Laboratories 
Holmdel , NJ 07733 

Advances in integrated optics, optical communications, optical 
bistability, and optical disk storage have prompted a renewed interest in 
studying the possibility of an optical digital computer. The limitations in 
utilizing optics stem from the fact that it is very difficult to get one 
optical signal to affect another. This makes it difficult to perform 
switching. One approach around this problem is to detect one optical signal 
with a pn junction and stimulate another signal with another pn junction. 

The need for pn junctions immediately makes the use of optics questionable. 
The potential of optics ironically stems from the same inability for 
interaction. Lenses, prisms, and mirrors can easily communicate thousands of 
channels of information in parallel. The relevance of this to computers is 
that the throughput of future systems will be limited by clock skew, 
bandwidth, interconnection complexity, pin limitations, and the Von Neumann 
bottleneck. These are all communications rather than switching problems. 
Unfortunately, there is no easy way to incorporate the communications 
capabilities of optics in conventional processors since the architecture of 
these processors have been adapted by evolution to the implicit tradeoffs of 
an electronics technology. Such a change is so fundamental that a careful 
review of the basic structure and function of conventional processors is 
necessary. 


* Because this paper was not available at the time of publication, only the 
abstract is included. 
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A design for a optical digital computer is presented to illustrate how 
some of the unique properties of optics such as its innate parallelism and non- 
interfering propagation can be used to overcome some of the limitations of 
future computers. The basic approach involves decomposing the structure of a 
finite state machine, the distant ancestor of current computers, into a logic 
unit, an interconnection array, and a latching unit; and then implementing 
each of these units with optics. The logic unit for such a processor can be 
constructed from an optical NOR gate array. Such a logic array can be imple- 
mented by projecting several binary images on a common surface, inverting, and 
thresholding the intensity of the result. Each pixel functions as a NOR gate 
and such a NOR operation is sufficient to establish a complete logical set. 

The latching unit which serves as the memory for the finite state machine can 
be implemented with various bistable optical devices. The interconnection 
array can be implemented with a hologram or conventional optics which images 
each pixel (x, y) to pixel (x, y+1 ) , pixel (x+1, y+1 ) , and pixel (x-2,y+l ) . This 
interconnection pattern establishes a complete connective set which can be 
customized to provide arbitrary interconnections. 

This basic architecture, while abstract and quite primative, can be 
extended, specialized, and applied to a wide range of problems in image and 
parallel processing. 
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HIGH SPEED TECHNIQUES FOR 
SYNTHETIC APERTURE RADAR IMAGE FORMATION* 

Demetri Psaltis 

California Institute of Technology 
Pasadena, CA 91125 

Armand R. Tanguay, Jr. 

University of Southern California 
Los Angeles, CA 90007 

and 

Thomas J. Bicknell 
Jet Propulsion Laboratory 

California Institute of Technology 
Pasadena, CA 91109 

A synthetic aperture radar system produces data requiring rather complex 
processing to form the image of the terrain being mapped. The process is 
essentially a two-dimensional convolution which is usually separated into two 
one-dimensional convolutions with spatial variance of the reference function 
in one dimension. The reference functions may be quite large (1000 elements or 
more) requiring high computation rates for digital systems. Optical systems 
have a significant advantage due to the parallelism of one- or two-dimensional 
processing. The linear FM variation of the reference function in both 
dimensions makes the optical system rather simple. Since the point target 
phase histories are self- focusing, like Fresnel zone plates, only bandpass 
filtering is required for moderate resolution systems. The spatial 
variance of the reference function or phase history in the along- tract 
dimension is linear and can be compensated for by tilting either the input and 

* Because this paper was not available at the time of publication, only the 
abstract is included. 
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output image planes or a cylindrical lens. An additional complication of 
higher resolution systems is the requirement to compensate for range migration. 
This correction is usually performed by lenses used as a phase filter in the 
transform plane. A conventional radar correlator is shown in figure 1. The 
range telescope images both the range and the azimuth focal planes. The azimuth 
telescope affects only the azimuth or along-track dimension, bringing it into 
coincident focus with range. 


FREQUENCY 

PLANE 




RANGE TELESCOPE 



AZIMUTH TELESCOPE 


Figure 1.- Film input coherent synthetic aperture radar processor. 

The desire to achieve real time throughput, higher reliability, and greater 
accuracy has led to the development of high speed digital systems for SAR 
data processing. However, such systems are quite large, costly, and power 
consumptive, even using recent and expected future developments in digital 
technology. Many radar applications, such as quick-look or search modes, require 

or would be better served by onboard processing. These systems require small, 
lightweight processors, especially for use in conjunction with spacecraft- 
borne radars. A good example is the upcoming VOIR mission where onboard 
processing would allow transmission of 10 times as much data as the currently 
planned ground processing scheme allows. 
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One possible approach to high speed synthetic aperture radar signal 
reconstruction involves the utilization of two-dimensional real time spatial 
light modulators as recyclable replacements for photographic film in the input 
transducer plane of a modified SAR-coherent optical processor, as shown in 
figure 2. Leading candidate spatial light modulators include modified Pockels 
readout optical modulators (PROM), CCD-addressed liquid crystal light valves, 
and CCD-addressed membrane light modulators. The fundamental physical limita- 
tions affecting SAR-processor performance characteristics of such real time 
devices are under investigation. Current research on the PROM is focused 
on the effects of device operational mode, device constitutive parameters, 
electro-optic crystal orientation, writing wavelength, frame rate/data overwrite/ 
presuming, erasure completeness, and image retention on the overall quality of 
SAR image formation. Both modulated laser scanning and intensified CRT 
temporal -to-spatial input approaches are being examined. 

A spatial light modulator (SLM) that is commonly used in one-dimensional 
optical processors is the acousto-optic delay line. We are investigating the 
possibility of using acousto-optic devices (AOD's) as the input SLM in an 
optical SAR processor. AOD's cannot be used in a conventional SAR processor, 
however, since they can only introduce one-dimensional spatial modulation in the 
optical system. In the architecture we are investigating, the two-dimensional 
processing is accomplished utilizing one spatial dimension and one time 
dimension. Hence, a one-dimensional spatial modulator can be used. A 
schematic diagram of the processor is shown in figure 3. The backscattered 
signal for each transmitted pulse is applied to the AOD. The system is 
illuminated with a pulse light source in order to "freeze" the traveling 
acoustic wave in the AOD. The range compression operation is accomplished by 
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Figure 2.- Real time coherent synthetic aperture radar processor utilizing 
a two-dimensional spatial light modulator. 







Figure 3.- Real time coherent synthetic aperture radar processor utilizing 

a one-dimensional acousto-optic modulator and a CCD-array detector. 

the focusing action of lens L2- The photogenerated charge pattern stored in 
the CCD is shifted vertically by one pixel after each radar pulse. The signal 
that reaches the output stage of the CCD at each horizontal (range) location 
is the correlation of the range compressed signal and the transmittance of the 
mask placed immediately in front of the CCD detector. Azimuth compression is 
performed by making the transmittance of the mask a linear FM function in the 
vertical direction. 

Utilization of a two-dimensional spatial light modulator in a real time 
SAR processor allows for simplicity in image formation, and is, in addition, 
inherently flexible for handling different radar signal types. A disadvantage 
in this approach is the relatively recent development status of candidate 
spatial light modulators. The state-of-the-art of acousto-optic devices is, on 
the other hand, further advanced and represents a relatively mature technology. 
The limitations in the AOD/CCD approach are expected to be the dynamic range 
and number of elements of the CCD detector array. It is hoped that such 
optical signals in an integrated, compact, and rugged format. Many integrated 
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real time SAR processors will initiate new mission applications not presently 
envisioned due to current digital processor limitations. 
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OPTICAL IMPLEMENTATION OF SYSTOLIC ARRAY PROCESSING 

Jrf | I 

H. J. Caulfield , W. T. Rhodes , M. J. Foster > and Sam Horvitz 

* Innovative Optics , Inc., P.0. Box 1275, Concord, MA. 01742 
+ Georgia Institute of Technology, School of Electrical Engineering, 
Atlanta, GA. 30332 

-j- Carnegie-Mellon University, Computer Science Department, 
Pittsburgh, PA. 15213 

- Naval Underwater Systems Center, PA4 , New London, CT. 06320 


SUMMARY 


We show how certain algorithms for matrix-vector multiplication can be 
implemented using acousto-optic cells for multiplication and input data transfer and 
using CCD detector arrays for accumulation and output of the results. No 2-D matrix 
mask is required; matrix changes are implemented electronically. A system for 
multiplying a 50-component nonnegative-real vector by a 50 x 50 nonnegative-real 
matrix is described. Modifications for bipolar-real and comp lex- valued processing 
are possible, as are extensions to matrix-matrix multiplication and multiplication 
of a vector by multiple matrices. 


INTRODUCTION 


During the past several years, Kung and Leiserson at Carnegie-Mellon 
University (refs. 1,2) have developed a new type of computational architecture which 
they call "systolic array processing". Although there are numerous architectures for 
systolic array processing, a general feature is a flow of data through similar or 
identical arithmetic or logic units where fixed operations, such as multiples and 
adds, are performed. The data tend to flow in a pulsating manner, hence the name 
"systolic". Systolic array processors appear to offer certain design and speed 
advantages for VLSI implementation over previous calculational algorithms for such 
operations as matrix-vector multiplication, matrix-matrix multiplication, pattern 
recognition in context, and digital filtering. This paper grew out of our desire to 
explore the possibility of improving systolic array processors by using optical in- 
put and output. We will concentrate on describing the particular case of matrix- 
vector multiplication, but note that many other operations can be performed in an 
analogous manner. 


SYSTOLIC MULTIPLICATION OF A VECTOR BY A MATRIX 


The problem we address is that of evaluating a vector y given by 

y = Ax (1) 

where A is an n by n matrix, and x and y are n-component vectors. We assume that A 
has bandwidth w, i.e., all of its non-zero entries are clustered in a band of width 
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w around the major diagonal. Such matrices arise frequently in the solution of 
boundary value problems for ordinary differential equations. A systolic array that 
solves this problem is introduced by. Kung and Leiserson (ref. 1,2) and will be 
reviewed briefly here. 

A systolic array for multiplying a matrix of bandwidth w by a vector of 
arbitrary length has inner-product cells. The array for bandwidth 4 is shown in 
figure 1. Each of the four heavy boxes represents an inner-product cell, capable of 
updating the vector component y^ according to the replacement 


y. •** y. + a. ,x. 
i x ij ] 


( 2 ) 


The cells act together at discrete time intervals, or beats, with half of the cells 
active on each beat. The elements of the matrix A are input from the right, and the 
vector x is input from the top. Zeroes are input from the bottom, and accumulate 
terms of the vector y as they move upward. 

Figure 2 traces the action of the array for several beats, or pulsations, show- 
ing the terms of A and % and the partial terms of y that are in each cell on each 
pulsation. Thus on pulsation 1, y^ = 0 is entered. In pulsation 2, Xj is entered. 
In pulsation 3, vj becomes a^x^. In pulsation 4, yj becomes 

pulsation 5, Yj exits. Every other pulse another y^ exits and on that same pulse 
another yp. is inserted (at an initial value of zero) . 


OPTICAL SYSTOLIC ARRAY PROCESSING 


Key features of the systolic array approach to matrix-vector multiplication are 
(1) a regular, directed flow of data streams, (2) multiplication, and (3) addition or 
accumulation. These features are also characteristic of many optical signal process- 
ing systems, and it should come as no great surprise that optical implementations of 
systolic architectures are possible. Since both bulk and surface acoustic waves are 
routinely used in optical signal processing to produce a moving stream of data and 
for multiplication of data, it seems natural to use these components for optical 
systolic array processing. 

We choose as our example the simple matrix-vector multiplication 




a ll a i2 

CM 

i 


_ a 21 a 22 . 


(3) 


assuming initially that all quantities in this equation are real and nonnegative. 
The basic concept is illustrated with the help of figure 3. The system shown con- 
sists of an acoustooptic modulator illuminated by the collimated light from three 
LEDS, a Schlieren imaging system, and three detectors connected to a CCD analog 
shift register. At the moment illustrated in the figure, modulating signals pro- 
portional to x^ and X 2 have been input to the acousto-optic modulator driver, 
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producing short grating segments in the acousto-optic cell. As the x. grating 
segment passes in front of LED number 2 (the situation shown in the figure) , that LED 
is pulsed in proportion to matrix coefficient a-Q. The transmitted light, proport- 
ional in intensity to a is imaged onto CCD detector 2, which sends a proportional 
charge to an associated "bin" in the shift register. 

The x^ and x ^ grating segments now travel so as to be in front of LED’s 1 and 3, 
respectively. At the same time, the accumulated CCD charge from detector 2 is shifted 
one bin, in the direction indicated by the arrow labeled "output” in the figure. 

LED’s 1 and 3 are now pulsed in proportion to and a i2 5 respectively. Since these 

LED’s illuminate detectors 3 and 1 via grating segments and charge is generated 

by these detectors in proportion to &2l x l anc ^ a 12 x 2 5 respectively, and accumulated in 
the corresponding shift register bins. 

In the next increment of the system, charges are again shifted, with accumulated 
charge in proportion to a-QX-^ + &i2 K 2> or keing output. The charge packet now 
associated with detector 2 (already proportional to a^.x-.) is augmented by a final 
strobe of LED 2 by an amount proportional to &22 x 2' A final two shifts of the CCD 
charge packets bring charge proportional to a 21 x l * a 22 x 2> or y 2 s to out P ut > an ^ 
the operation is complete. 

The system illustrated is easily expanded to accommodate matrix-vector operations 
of higher dimensionality. If y and x are N-component vectors and A an N x N matrix, 
the maximum number of LED’s required is 2N-1 (the number of diagonals of the matrix), 
and the number can be smaller if A has a smaller bandwidth. 

Numerous variations of the system of figure 3 are possible. Figure 4, for 
example, shows the LED’ s replaced by a single light source and an array of modulators. 
The CCD shift register has been replaced by stationary detectors and integrators 
combined with a second acousto-optic cell, which serves to deflect light to the correct 
detector/integrator. The acousto-optic deflector approach to sorting output data may 
facilitate greater system dynamic range than is achievable with CCD detector arrays. 


BIPOLAR AND COMP LEX- VALUED COMPUTATIONS 


It was assumed in the preceding section that all elements of the matrix and in- 
put vectors were nonnegative-real. In practice, most matrix-vector multiplication 
operations of importance involve bipolar-real or complex-valued vectors and matrices, 
and some means must be employed for handling them. If the elements are real valued, 
but not necessarily nonnegative, a two-component decomposition scheme described in 
ref. 3 can be employed. For complex-valued processing, several schemes have been 
described (ref. 4). One of these involves a three-component decomposition of 
complex numbers according to ref. 5, 


z = Zq + exp [i27r/3] + exp £i4fr/3] , (4) 

where z^, z and z are nonnegative-real. Another involves biased real and imaginary 
components (ref. 6;. All such methods lead to some additional processor complexity 
and to a reduction in the size of the vectors and matrices that can be accommodated. 


55 



OPERATING PARAMETERS OF A TYPICAL SYSTEM 


Matrix size limitations are imposed by the acousto-optic modulator. Consider a 
system using for input a bulk acousto-optic cell with a 100 MHz bandwidth and a 10 
ysec time window. We estimate that such a cell should accommodate 100 LED/lenslet 
combinations operating side by side, allowing multiplication of a 50-component 
nonnegative -real vector by a 50 x 50 nonnegative-real matrix. Achievable dynamic 
range depends on CCD detector dynamic range and on the correction of LED and 
acousto-optic modulator nonlinearities; it is too speculative to suggest numbers at 
this time. Operating speed is determined by the amount of time it takes to shift 
the components of x through the acousto-optic cell, plus setup and final readout time. 
For the 10 ysec window cell under consideration, it takes 5 ysec to get the x-^ grat- 
ing segment to the middle of the acousto-optic cell, at which time the first LED 
pulse occurs. The last LED pulse occurs 10 ysec later, when x^q finally passes the 
midpoint of the cell. Following that pulse, an additional 50 ysec are required to 
read ycQ out of the shift register. The time required for the 50 x 50 matrix-vector 
multiplication is thus 10 ysec processing time and 10 ysec latency, for a total of 
20 ysec. During the processing interval, a total of 2500 multiplications are per- 
formed, at a rate of 2.5 x 108 multiplications per second. With suitable encoding 
of the data (refs. 3,4), this corresponds to a processing rate of 6.25 x 10 7 bipolar- 
real multiplications per second or 2.78 x 10' complex multiplications per second. 


VARIATIONS 


The system described does not exploit the two-dimensionality of the optical 
system. More than one matrix can multiply the same input vector at the same time if 
the single linear LED/lenslet and detector arrays are replaced with a collection of 
linear arrays, one above the other. Shear wave acousto-optic modulators, with nearly 
square window formats, can accommodate perhaps 20 such linear arrays, allowing 20 
separate matrices to multiply the same input vector at the same time. 

Matrix-matrix multiplication can be performed with related systems using 
multiple acousto-optic cells, or, alternatively, single cells with multiple driver/ 
transducers. Figure 5 shows one possible arrangement for multiplication of two 
2x2 nonnegative-real matrices. In general, for such a scheme, . multiplication of two 
N x N matrices requires two multi-transducer acousto-optic modulators with 2N-1 
transducers each. Alternatively, one such multi-transduGer cell could be used, 
illuminated by a 2-D array of N^-2 LED ! s. 
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Figure 1.- Systolic multiplication of vector x by banded matrix A. Traditional 

representation of this operation is shown in (a) . Basic cell for this operation 
is shown in (b) . Flow of x, y and A data is shown in (c) . 
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first seven pulsations of the processor of figure 1(c) are 
here and described in the text. 
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Figure 4.- An alternative optical implementation of the processor of figure 1(c). 
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Figure 5 . - Use of crossed acousto-optic cells to produce A x B_ = C_. Input 
information flow is shown in (a) , and calculated C values are produced 
as indicated in (b) . 
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ABSTRACT 


Optical Matrix Vector Multipliers (OMVM) are of interest to real time adaptive 
array beam forming. Feasibility of high speed compact signal processing for 
applications such as sidelobe cancelling has yet to be demonstrated. The 
importance of the OMVM stems from the following: 

(1) Because of its optical parallelism it can operate at extremely high 
speeds, surpassing by several orders of magnitude competing electronic 
techniques. 

(2) The matrix-vector multiplication operation is basic to a variety of 
important signal processing applications, including adaptive array 
processing and other general control operations. 

The very high speed of the OMVM strongly suggests its application to iterative 
processing, where in a very short time the system can cycle through perhaps 100 
iterations, and assuming necessary convergence, yield a solution. This method 
has been suggested as a means of inverting the covariance matrix associated 
with noise samples from adaptive array antenna elements. A review of the jammer 
problem for radars indicates that solution is required in the 1 ysec to 10 ysec 
range. The spatial light modulator must be updated within this time frame. 

We will describe here the optical architecture of an incoherent matrix vector 
multiplier where the matrix is complex and separable into outer product vectors. 
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INTRODUCTION 


In the late 1970’ s, researchers at Stanford University began investigation 
of a new class of incoherent electro-optical processors for high speed matrix- 
vector multipliers (ref. 1, 2). Work has continued, both at Stanford (ref. 3) 
and Carnegie Mellon Institute, to the point were significant processing 
capabilities have been demonstrated. It is not unreasonable to expect that 
such systems could perform 10 8 matrix-vector multiplies per second with 100x100 
element matrices. 

One limitation of these processors as developed thus far is the relatively 
fixed nature of the mask representing the matrix. in general photographic 
masks have been employed, and processing where the matrix must change with some 
rapidity has been impossible. 

In our search for a matrix mask that can be updated rapidly, we have considered 
several candidate spatial light modulators (ref. 4) (SLMs) , including the 
Itek PROM, and the Hughes LCLV. Neither the PROM nor the LCLV can be cycled 
at rates approaching 1 MHz, and these candidates are thus unacceptable. A mosaic 
of pockels cells could perhaps be made to operate in the desired fashion. 

However, there are the usual problems of switching high voltages in microseconds, 
and severe electronic crosstalk appears inevitable, we 

have settled on a scheme whereby the mask is implemented acousto-optically . 

Acousto-optic modulators possess several desirable features when used in signal 
processing systems. These include reliability, availability, high response 
speeds, and wide bandwidths. However, they are not easily adaptable to use as 2-D 
spatial light modulators. The basic technique and system architecture for a 
pair of 1-D SLMs is discussed in the following section. 


ACOUSTO-OPTIC IMPLEMENTATION OF VECTOR MULTIPLICATION BY OUTER PRODUCT MATRICES 


The operation to be implemented is: 


Y — H X (1) 

where X is a complex input column vector. 


H is a complex matrix separable into outer products: 

H = A B t 


( 2 ) 


The multiplication of separable matrices would be possible if the OMVM were a 
coherent processor. When the necessary complex to non-negative real encoding 
of H is implemented for an incoherent processor, the separable form of the system 
matrix is destroyed. However by using time multiplexing we can still avoid the 
requirement for 2-D masks. Consider one possible method. We can define 
expressions in which vectors and matrices are defined in the following form (ref 5,6). 
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(3a) 
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i9 i29 

Hq + + H 2 e 


(3b) 

(3c) 


where X Q , X , X 2 ; Y Q , Y.^ Y^ 1^, H 2 are all non-negative 

real and where 9 = 2 7T/3. The components X^, Y^ , Ih, i = 0,1,2 have the 
dimensions of X, Y, and H, respectively. 


The basic matrix vector multiplication operation can be written: 
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Consider the case where H is an outer product matrix (2) . If we express 

T "j 0 

(2) in terms of a three component decomposition , we have H = (A^ + A^e + (5) 

— i20. — t , ~t i0 , -t i20 t 

A 2 e ) (B 0 B i e B 2 e } 


Multiplying through and collecting terms in powers of exp (iQ) we find that 


So 

= A o 
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A 1 
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+ 

A 2 

B o 



To represent H by a three-component decomposition, we need nine outer 
products. 
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These nine outer products can be produced using a pair of crossed acousto - 
optic cells with multiple transducers. Figure 1 illustrates the case where 
A is a two-dimensional complex matrix. The desired outer products are 

formed at the intersection of the acoustic waves, in the following format: 
Let M = 


0 0 0 1 


M = 


A 1 B 0 A 1 B 1 


A 2 B 0 R 2 B 1 


A 0 B 2 


A 1 B 2 


(7) 


A^ B 
2 2 


At this stage we encounter a difficulty in system architecture: although_the 

various outer products of (3), together with input vector components X^, X^, 
X 2 , are sufficient to allow calculation of Y, these outer products are not 
arranged so as to allow direct implementation of equation (4) . The three- 
component form of matrix H 


H 


«0 


H„ 


h So 


H, 


s 2 


( 8 ) 


-2 


Si 


So 


is itself not representable as an outer product. As a 

consequence, it appears as though system architecture will of necessity 
be more complicated than would be the case were H directly realized by, 
for example, a more general 2-D SLM. 

Let us consider the problem further to gain a better understanding of its 
nature. We simplify notation by denoting the elements of equation (7) by 
M. . : 


Soo 

S 01 
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From equations (3) , (6) , (7) and (9) we have 
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Inspection shows that component Y^ can be expressed by 


Y = 
n 


I 

i, j 


M. .X. 
—ID k 


where i,j,k satisfy the relationship 

(i+j+k) mod 3 = n 


(10a) 


(10b) 


(10c) 


( 11 ) 
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Given the matrix M and input X = (X Q X^ X^) , it is not possible to generate 

all the terms of (10) by a single vector -matrix multiplication operation. 

It is, however, possible to generate Y with three vector -matrix multiplication 
operations, as follows. 

First, input the vector (X X and calculate its product with M. 

We write the resulting product as 


0a 


Y, = M X„ 
la ~2 


'2a 


This is followed by the operations 


'lb 


Y_ = M X_ 
2b “0 


'Ob 


X, 


'2c 


0c 


M X„ 


lc 


(12a) 


(12b) 


(12c) 


If the outputs of these three operations are properly combined, the result is 
the desired three -component representation for Y. Specifically, 


Y^ = Y^ + Y_. + Y_ 

0 0a Ob 0c 


(13) 


Y + Y + Y 

la lb lc 


Y 2 Y 2a + Y 2b + Y 2 c 
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Study of these equations shows that as the components X Q , , and are 

stepped cyclically in the input, the components of Y Q , , and Y^ are 

themselves stepped cyclically. One method for performing the summation of 
(12) and (13) is to use charge -coup led integrating detectors in the output 
plane of the optical system. The integrated charges can then be stepped 
synchronously with the input components. Another method is to use high 
speed analog switches and short delay lines for storage. 


PLANS FOR IMPLEMENTATION 


The basic optical layout of the iterative processor is shown schematically 
in figure 2. Light emitted from the sources is collimated into N parallel 
beams by means of a set lens Ll. Each lens is a segment of a spherical lens. 

Thus, each row of the first SLM is illuminated by a collimated beam from one 
of the light sources. The output beams from the AO Bragg cell, A, are focused 
by lens L4, spatially filtered by SI, and recollimated by L5. The lens 
set L4-L5 images the aperture of Bragg cell A at the second Bragg cell, B. 

The modulated beam from the second Bragg cell is spatially filtered and brought 
to focus on the linear detector array. The spatial fitlers are necessary 
only to remove the zero and second order beams. The output beams are modulated 
in intensity at NxN points within the optical aperture. This representation 
is equivalent to having both masks located at the same plane. The outputs 
from the light emitters are modulated^by vectors whose elements are X,. On 
multiplication, the output terms are Y. = V" A, B. X,. ^ 

1 L 3 1 3 

j 

When a non-negative real approach is implemented, the above scheme must be 
modified. The speed of the processor is slowed down by approximately a factor 
of three, since for each iteration of three successive multiplications must be 
carried out. After each multiplication the analog data must be stored while the 
input vector is shifted. The three successive outputs are added as in equation 
(18) . The output vector then becomes the input to the light emitters as in other 
iterative optical processors. 

The time multiplexed method described can be implemented using CCD shift registers. 
Unfortunately, state of the art CCD shift registers operating at the necessary 
speed lack the dynamic range and noise properties necessary for our applications. 

We have thus chosen to implement the previously mentioned procedure utilizing fast 
four channel CMOS analog multiplexer switches arranged in a switching matrix con- 
figuration. Referring to figure 3, as the MUX is clocked, its output is shifted 
along the linear laser digde arrays. Similarly, at the linear photodiode array, 
the outputs (Y^ Y^^ Y 2a^ trom the first input cycle (X^ X^ X^) 

are stored^_in coaxial cable delay lines. ^ The input X vector is then switched to 

(X X X^) and the output (Y Y^ Y^) is switched to the proper location 

and stored in another delay line. This procedure is repeated a third time, 
with the appropriate combinations taken to yield the desired outputs. 


67 



Some of the problems particular to the implementation involve non-linearities 
in the Bragg diffraction process, electrical and acoustic crosstalk, and limited 
dynamic range due to light scattering. 


CONCLUSIONS 


An analysis has been performed of the applicability of iterative optical 
processors in environments requiring a short transient response time. An 
approach has been proposed utilizing realizable acousto-optic single 
dimension light modulators where the matrix is the outer product of two vectors. 
The problems associated with a non-negative real representation have been 
addressed. A solution has been proposed albeit at some cost of processor 
speed. A proof of concept experiment is underway which will better determine 
the limitations and advantages of this method. 
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INTRODUCTION 

In recent years a growing interest has developed in incoherent optical processing 
(ref. 1). As the invention of the laser was an important force in coherent optics, 
the advent of fast and very compact solid state light sources is a major force be- 
hind this interest. 

Incoherent optical processing has indeed several appealing characteristics, 
namely higher immunity to optical noise and reduced dynamic range requirements than 
coherent processing as well as an inherent versatility of input format (serial or 
parallel) . 

A very fast fully parallel incoherent optical multiplier was developed at Stan- 
ford (refs. 2 and 3) and this device has been shown to be capable of performing com- 
plex-valued arithmetic on vector formatted data arriving in parallel to the processor 
inputs . 

There is a considerable number of applications for processors of this sort how- 
ever there is a need to extend their capability for two-dimensional parallel data. 
Indeed one can understand the interest in speeding real time processing capabili- 
ties in the context of reconnaissance sensors, two-dimensional beamforming, etc. 

This work addresses the latter case. We propose an extension of the concepts 
developed earlier to the new generation of matrix-matrix multipliers using incoher- 
ent light. This new technology has the potential of addressing problems involving 
two-dimensional mathematical transforms, two-dimensional pattern recognition, and 
high-speed processing of synthetic aperture radar data among others. 


SYSTEM ARCHITECTURE 

Let us then consider the matrix multiplication 

u Cj= 1 _A Jl B J O) 

where ^ is an M x K matrix, and JJj and JSj are K x N and M x N matrices respectively. 

The element c^ in the m-th row and n-th column is given in terms of the ele- 
ments g mk of ^ and b kn ofJJjby 


c 

mn 


K-l 


I 

u=r 


a b 
mk kn 


( 2 ) 


A matrix can, of course, be viewed as an ordered collection of column vectors. 
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If we define 
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the matrix product ^ = L A Jl B_ J becomes the sum of N matrices whose only non-zero 
elements are in its n-th column. In consequence, the matrix product can be seen 
as an ordered collection of column vectors each of which is the product of ^A^by the 
corresponding column vector of L B J A matrix-matrix multiplication can then be ex- 
pressed as a series of matrix-vector multiplications. The technology developed for 
the incoherent optical matrix-vector multiplied (ref. 2) can then be extended to 
this new application. 


The concept here is to combine N matrix-vector multipliers in a single proces- 
sor. The physical realization of this architecture is depicted in Figure 1. 

The elements of matrix are entered as irradiance values of a two- 
dimensional incoherent array that may be realized using narrow-band light-emitting 
diodes (LED), laser diodes, or even an array of polished optical fiber ends. The 
elements of matrix ^ are encoded as transmission values of an optical transparency 
or as reflected values of an optically reflective device. In either case, its im- 
plementation can be fixed in time (e.g., a film transparency) or it can vary with 
time (e.g., a real time optical modulator). 

The first block of optics (figure 1) images horizontally the input array ( B ,onto 
the optical mask [ A J and spreads vertically the light of each light source along the 
corresponding vertical column. Figure 2 illustrates the illumination produced by 
the central column of a 3 x 3 input matrix on a 3 x 3 mask. The light transmitted 
through the mask encodes all the possible inner products of the column vectors of 
the input matrix by all the row vectors of the stored matrix. 

The second block of optics (figure 1) integrates these inner products and 
focuses the light on the proper output array locations — imaging vertically and 
focusing horizontally onto the vertical column corresponding to the particular 


At this stage, we assume that the elements of ^ and are real and non- 
negative. Later, we will remove this restriction. 
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horizontal row at the input. This is a demanding operation that can be more clearly 
visualized in the case of a 3 x 3 matrix. We illustrate in figure 3 how we obtain 
the outputs (1, 1) and (3, 2). The output (3, 2) can readily be seen as the inner 
product of the 3-rd row of matrix^ and the 2-nd column of matrix as would be 
expected. Finally, figure 4 shows the light path through a particular cell in the 
stored mask. The element a^ (the central element in this example) maps the ele- 
ments of the k-th row of inputs onto the elements of the m-th row of output, C>, 


OPTICAL IMPLEMENTATION 


The optics block labeled "Optics I” in figure 1 has the same requirements as the 
mask illumination of the incoherent optical matrix-vector multiplier (ref. 3). In 
the latter case, a discrete set of classical discrete optical elements in one ver- 
sion and an array of multimode planar waveguides in another version were used. In 
the present case, a very similar form of the classical discrete optical components 
approach mentioned could certainly be used. 

The second block of optics — Optics II — has more demanding requirements. In 
effect, it must steer the light transmitted by each cell in the stored mask to all 
N elements in the corresponding row of the output detector array (figure 4). Holo- 
graphic optical elements can be used as the steering beam devices in this type of 
processor. An example of the application of this technology was shown by S. Case 
and his co-workers in the paper "Multifacet Holographic Optical Elements," presented 
at the Optical Society of America Annual Meeting in October 1980 (Abstract in 
J.O.S.A. , 12, 1980). 

In this context, we suggest the geometry depicted in figure 5 where each narrow- 
band light source and each cell of the stored mask are located behind a holographic 
optical element (HOE). This is a well established technology with the potential 
for a highly efficient light management. 


COMPLEX-VALUED ARITHMETIC 


It is, of course, important for a processor to have the capability to perform 
complex-valued operations. The incoherent optical processor we report here manip- 
ulates light intensities which are real- valued and non-negative in nature. Methods 
for encoding complex numbers similar to those used in the incoherent optical ma- 
trix-vector multiplier (ref. 4) can be applied here. 


The complex-valued arithmetic capability built in this processor is an immediate 
extension of the schemes utilized before, and for completeness, we will briefly 
elaborate two possible methods. 

A . Three Vector Decomposition 


Each complex quantity is decomposed uniquely along the three phasors 1 • exp [jO], 
l*exp [ j 2tt /3 ] and l*exp [ J 4 tt / 3 ] . There is more than one way to perform this decom- 
position (ref. 4, Appendix A) but we do not need to dwell on it here. 


73 



The three matrices can be decomposed according to 
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where the elements of the different matrix components are all real-valued non- 
negative quantities. 


The matrix -matrix multiplication , C ,= , A „B . can now be expressed in terms of these 
quantities as 
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from which the output complex-valued matrix jCj can be obtained after the proper re- 
combination of its real valued and non-negative components Cq^, and C 2 * 

The processor’s configuration is shown in figure 6, where we have omitted the 
optics. The size of the input and output arrays increases by a factor of 3 and the 
stored mask by a factor of 9. 


B. Biased Real and Imaginary Parts Deco mposition 

Another encoding scheme decomposes each complex quantity into its real and 
imaginary parts and adds to these a bias term. 


iAj an dJLi can then be decomposed into their real and imaginary parts. 
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(7) 


Assuming known the dynamic range of the elements of the real and 
of^andjk, we ass *-S n a bias matrix B^ to matrix^ and a bias matrix 


imaginary parts 
Bg to matrix 
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These bias matrices have constant elements. All elements in the same column are 
identical, but they can vary from column to column. 


If now we configure the system as it is shown in figure 7, the detected outputs 
Cj^ and Ci are given by 
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The real and imaginary parts and of the product ^ (given by equation 
(4)) can be obtained from equations (8a) and (8b) 
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(9b) 


In each equation, the first bracket is known a priori and it can be electron- 
ically represented by a fixed voltage. The second bracket can be measured, adding 
an extra transparent row to the stored mask. Hence, with proper and straightforward 
post-detection electronics, one obtains the desired product real and imaginary 
parts . 


SYSTEM PERFORMANCE ESTIMATE 


At this stage of the system development, we address ourselves only to the major 
performance parameter — the data throughput. 

We can reasonably postulate that it will be feasible to build matrix-matrix mul- 
tipliers with complex-valued matrices with sizes larger than 100 x 100. Without 
being overly optimistic, we can envision the electronics side of such processors 
being driven at clock rates greater than 100 MHz. This implies data throughput 
rates exceeding 10^ complex-valued samples per second. This is at least two 
degrees of magnitude higher than throughput rates obtained with matrix-vector 
multipliers. 


APPLICATIONS 


The parallel processing character of this processor is an essential feature that 
is extremely useful when a large number of input channels are present 
simultaneously . 

A large number of problems dealing with matrix multiplication can be implemented 
with this processor. We may refer problems involving Fourier transforms (beam 
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nulling, beamforming), image processing problems (pattern recognition, filtering), 
and signal identification (signature reconnaissance) , to name just a few. 

In this work, we will single out two fundamental mathematical applications that 
may be implemented using this technique. 

A. Two-Dimensional Transforms 


Let us consider a two-dimensional array i(k, 1) assuming values in the rectangu- 
lar range k = 0, 1, . . .,N-1, 1 = 0.1, . . . , N - 1. A two-dimensional trans- 
form over this array produces a second array o(m, n) given by 


o(m, n) 


N-l M-l 

t(m, n, k, 1) i(k, 1) 

k=0 1=0 



m = 0 , 1 , . . .,N-1 
n = 0, 1, . . . , M - 1 


where t(m, n, 
We assume 
t (m, 


k, 1) is the transform kernel. 

the kernel t(m, n, k, 1) is separable 

n, k, 1) = v(m, k) h(n, 1). 


( 10 ) 


( 11 ) 


We can then express o(m, n) as 
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( 12 ) 


and factorizing h(n, 1) and performing first the sum over k, we obtain 

N-l N-l 

o(m, n) = ^ h(n, 1)^ v(m, k)i(k, 1) 


(13) 
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The sum over k is a matrix product (see equation (2)) producing the intermediate 
result 


N-l 

s (m, 1) = ^ v(m, k) i(k , 1) (14) 

k=0 

which can be written in matrix notation 

JL'lLiLi (15) 

where is the input matrix,^ is the transform kernel matrix along the columns and 
Sj is the intermediate matrix. The output can then be expressed as 
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(16) 


o(m, n) 


M-l 

7 h (n , l)s(m, 1) 
1=0 


or in matrix notation 



where is the transform kernel matrix along the rows and ^ is the output matrix. 

Equation (17) shows that two matrix multiplications in sequence can implement a 
two-dimensional transform (as given by equation (10)) when the transform kernel is 
separable . 

The immediate application that comes to mind is the two-dimensional discrete 
Fourier transform (DFT) . The discrete Fourier transform kernel is given by 
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Figure 8 illustrates a basic configuration for a two-dimensional DFT implemented 
with two matrix-matrix multipliers. 

B . Iterative Processing 


Another significant application of incoherent optical matrix-matrix multipliers 
is in the area of iterative processing. Figure 9 shows a basic configuration. 

There are several possible uses for iterative techniques. Important examples 
are the phase retrieval problem related to imaging through turbulence (ref. 5) 
and the implementation of some numerical methods for matrix inversion. 

This work was performed under an ERIM internal research program. 
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2-D INCOHERENT MATRIX 2-D DETECTOR 
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Figure 1.- Basic block form of the incoherent optical matrix-matrix multiplier. 



Figure 2.- Example of to ^ illumination. For simplicity sake we will refer 
to matrix as the stored mask, though it may be variable with time. 
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Figure 5.- Matrix-matrix multiplier. Implementation with holographic 

optical elements. 



Figure 6.“- Three-vector decomposition configuration. 
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Figure 7.- Biased real and imaginary parts decomposition configuration. 
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One of the most significant developments in radio 
astronomy has been the recent discovery of over 50 
different molecules in the interstellar medium. These 
observations have changed our picture of the 
distribution of mass in the galaxy, altered our 
understanding of the process of star formation, and 
also opened up a new and lively field of interstellar 
chemistry. This achievement was made possible not only 
by the development of sensitive heterodyne receivers 
(front-end) in the centimeter and millimeter range, but 
also by the construction of sensitive RF spectrometers 
(back-end) which enabled the spectral lines of 
molecules to be detected and identified. 

Traditionally, spectrometers have been constructed as 
banks of discrete adjacently tuned RF filters or as 
digital auto-correlators. However, a new technique 
combining acoustic bending of a collimated coherent 
light beam by a Bragg cell followed by detection by a 
sensitive array of pho tod e tec tors (thus forming an rf 
acousto-optic spectrometer (AOS)) (promises to have 
distinct advantages over older spectrometer technology. 
An AOS has wide bandwidth, large number of channels, and 
high resolution, and is compact, lightwaight, and 
energy efficient. These factors become very important 
as heterodyne receivers are developed for ever higher 
frequencies. 
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The thrust of receiver development is towards high frequency 
heterodyne systems, particularly in the millimeter, submillimeter, 
far infrared, and 10 ym spectral ranges. The motivation for this 
development comes from the need to determine the chemical 
composition of the interstellar medium and to achieve a thorough 
understanding of the. excitation of interstellar molecules. The 
most important constituent of the interstellar medium after 
molecular hydrogen is carbon monoxide (CO) which has its 
rotational transitions at 115.3, 230.5, 345. B, and 591 GHz, etc., 
while hydroxyl (OH) , an equally important constituent, has its 
ground-state rotational transition at 250B GHz. In general, the 
lighter the molecule, the higher its ground-state transitions 
which fall often above currently accessible receiver ranges. 

The LO, LO coupler, and mixer present difficult challenges as 
the range of the receiver is extended towards higher frequencies. 
Klystrons which are already expensive and short-lived in the 3 
millimeter range become impractical at shorter wavelengths forcing 
development of frequency doubling or tripling mixers utilizing 
cheaper and more robust lower frequency LQs. In the 10 mm band the 
CO^ gas laser-heterodyne system, with its various isotopic 
mixtures, can cover only about 10% of the spectral band due to the 
limited tunability of the gas laser transition and limited 
bandwidth response of mixers. Work is presently underway in this 
laboratory to construct a diode heterodyne system which will have 
a 2 mm continuously tunable range. RF tuned cavity LO couplers 
which work well in the 3 millimeter wavelength become highly lossy 
in the millimeter range forcing developnent of quasi-optical 
techniques to combine signal and LO efficiently to the mixer. 

Mixer development is also continuing to extend usable range of 
mixers into the far infrared and 10 mm bands. Recently we have 
used a 25 watt CG^ 10 mm laser to pump a formic acid 430 mm laser 
in an experiment on the 3-meter IRTF in Maun a Kea to detect and 
map the 591 GHz, J = 6 -»• 5 transition of CO in Orion and other 
galactic sources. 

Common to all heterodyne systems is an iRF spectrometer 
(back end) which recovers the power spectrum of the signal frcm the ( 
IF. Traditionally the spectrometer is a bank of discrete 
adjacently tuned RF filters or a digital auto-correlator. Most 
radio telescopes have 255 or 512 channels at 1 MHz or 0.25 MHz 
resolution. Heterodyne receivers in the millimeter , 
submillimeter , or far infrared wall be operated from remote 
mountain, airborne, balloon-borne, or spaceborne platforms to 
avoid the severe atmospheric attenuation in the 20 mm to 1 mm 
range. This will place severe demands of size, weight, and energy 
usage on the back end which RF filters may not be able to satisfy. 

Recent developments in acousto-optic techniques and in 
photodetector arrays have made feasible a new type of RF 
spectrometer offering the advantages of wide bandwidth, high 
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resolution, large number of channels in compact, lightweight, 
energy efficient, and relatively low cost systems. Such a system 
employs an acousto-optic diffraction cell which serves the key 
role of converting RF signals to ultrasonic traveling waves 
modulating the optical index of the cell. The cell is illuminated 
across its aperture by a monochromatic laser beam. A fraction of 
the light is diffracted by the acoustic waves; the angle of 
diffraction is determined by the frequency while the intensity of 
the diffracted light is proportional to the power of the input RF 
signal. (The major portion of the laser beam at zero order passes 
through the cell undeflected.) A focusing lens follows the cell 
and essentially performs a Fburier transform of the RF signal into 
a far-field intensity pattern. The output intensity distribution 
is typically received by a linear array of photodetectors whose 
output is the RF power spectrun vk seek. The advantage of an AOS 
is due to the simplicity arising from the small number of 
components needed to build up the system. 

Currently there is intense commercial interest in applying 
acousto-optic techniques to electronic warfare (EJ a /) , electronic 
counter measures (ECM) and electronic support systems (ESM) which 
is spurring rapid technical advancements in the field. The Air 
Force and Navy, in a joint effort, are funding research at 
GTE-Sylvania , ATI-Itek, ESL, Teledyne-MEC, and Rockwell 
International in bulk Bragg cell AOS components. One benefit from 
this research is the commercial availability of 1 GHz bandwidth, 1 
MHz resolution Bragg cells. The Air Force and the Navy have also 
funded Hughes Research and kfestinghouse in an effort to further 
miniaturize an acousto-optic system by integrating all its 
components on a single substrate. These recent developments have 
made acousto-optic techniques increasingly attractive for use in 
astronomical applications. 

We are currently both experimenting with a bulk and awaiting 
delivery of an integrated AOS device for astronomical research 
here at Goddard. The GSFC prototype AOS uses a discrete bulk 
acoustic wave Itek/Applied Technology Bragg cell with 300 MHz 
bandwidth (specified) 0.57 MHz resolution (500 resolution 
elements) , 5 rnW Spectraphysics Model 120 helium-neon laser with 
aluminum optical bench components assembled from Klinger 
Scientific. The length of the stainless-steel rods is 1 meter, 
and although the optical layout is fairly compact, no attempt was 
made to optimize the unit for size. The laser is located on the 
bottom of a two tier construction, with the beam directed to the 
upper level by guiding mirrors. A beam expander (16X) is placed in 
front of the Itek Bragg cell with the cell mounted on X-Z position 
translators. An aberration-minimized biconvex lens 80 mm in 
diameter with a 47 an focal length follows the Bragg cell. The 
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diffracted light is then guided downward by a mirror flat, and the 
output light is detected by a 1024 element CCPD Reticon array 
mounted on precision rotation and translation stages. Hie 
interlocking construction of the Klinger assembly gives the system 
very good rigidity and once the optical path has been aligned, 
retains its alignment even after movement of the assembly as a 
unit. Although no attempt has been made to temperature control 
the assembly the AOS appears to drift less than 1 channel over a 
24-hour period. 

The Itek/ Applied Technology Bragg cell is made from a 1 cm 
optical aperture LiNbO crystal with a specified 300 MHz bandwidth 
centered at 450 MHz. The interaction time is 1.5 ms with a 
time-bandwidth factor of 450. The diffraction efficiency of the 
cell is reported to be 7%/vratt of Rlj' power. Preliminary results 
of the GSFC AOS are given elsewhere 1 . 

An exciting prospect is the use of integrated optics in 
further miniaturizing an acousto-optic spectrometer. Recently, 
Westinghouse, under NRL contract and GSFC participation, hasj 
constructed a 400 MHz bandwidth, 140 element integrated AOS . 

This entire RF spectrometer has dimensions of 7X3 cm. The] 
spectrum analyzer, shown schematically Fig. 1, consists of a 
laser, a diffraction-limited geodesic collimating lens, a surface 
acoustic wave (SAW) transducer array, a second diffraction-limited 
geodesic lens that is utilized as a transform lens, and a 140 
element photodiode array. A Te polarized 100-mW He-Ne laser 
operating at 0.6328 mm was end- fire coupled into the spectrum 
analyzer for preliminary testing. Efforts are now underway by 
Westinghouse to butt couple a diode laser source to the substrate. 

The spectrum analyzer was fabricated on X-cut LiNbCL with the 
c axis parallel to the acoustic propagation. The optical 
waveguide was formed by diffusing 180 A of titanium into LiNbO_ to 
obtain a tightly confined optical beam. The photosensor array'was 
butt coupled at a 45 angle to the waveguide edge of the LiNbO^ 
substrate in an optically polished Ger-Vit mounting block. Tills 
angular mounting minimizes spurious signals due to reflected light 
from the surface of the photodiode array and waveguide substrate. 

The laser beam is coupled into the waveguide and collimated 
by the first geodesic lens. The SAW spatially modulates the 
guided 'optical beam. The deflected beam is focused onto the output 
edge of the optical waveguide. The butt-coupled photodiode array 
then detects the deflected beam. The deflection angle is 
proportional to the frequency of the RF signal. Therefore, the 
frequency of an incoming signal can be determined from thel 
position of the focused beam on the detector array. 
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The geodesic lenses are circularly symmetric aspherical 
depressions in the surface of the waveguide and are fabricated by 
single-point diamond turning. The advantages of using 
single-point, diamond turning for fabricating geodesic lenses are 
numerous. First, the lens can be precisely located on the 
substrate, which allows the focal plane of the lens to be placed 
at a predetermined position. Second, the lens profile can be cut 
to tolerances of better than 0.5 mm, which gives 
diffraction-limited performance and predetermined focal lengths. 
Finally, surfaces of the lenses can be cut very smoothly, thus 
eliminating the need for much post-machining surface polishing. kfe 
have measured the performance of diamond machined lenses and have 
found them to be diffraction-limited with an insertion loss of 2 
dB, and to have a focal plane which is within 12 mm of the 
previously polished waveguide edge. 

The detector array is a self- scanned photodiode array 
consisting of 150 photodiode pixels with a 12-mm pitch. The 
maximum shift register clockrate is 5 MHz, resulting in an access 
time of 2 msec. The maximun measured output voltage was 1.0 volt 
while the minimum measured output voltage was 90 mvolt, which 
yields a dynamic range of 40 dB. The next nearest neighbor 
crosstalk was down by 15 dB. A single zeroth-order bean sensing 
photodiode is included on the 6.4-mm square chip. The array is 
divided into 7 pixel groups of 20 pixels each. Each group is 
addressed in parallel by a 10-stage dynamic PMOS shift register. 
Each group has two electrometer output circuit channels; one 
handles odd pixels, while the other handles the even pixels for 
each clock cycle of the PMOS shift register. This scheme provides 
fourteen parallel samples every clock cycle. 

The surface acoustic wave transducers are a two-element 
tilted array designed to have a combined 400-MHz bandwidth 
centered at 600 MHz. The operation of the spectrum analyzer and 
the frequency range over which the transducers operate are shown 
in Fig. 2. A separate measurement of the transducer performance 
provided 3-dB bandwidth of 400 MHz and a 5% deflection efficiency 
with an RF power of 60 mW. 

In summary, this laboratory is in the process of constructing 
and evaluating bulk AOS devices with 300 MHz and 1 GHz bandwidths 
for use in the back end of high frequency heterodyne receivers for 
use in astronomical research. In addition, we are in the process 
of obtaining and will evaluate the feasibility of using integrated 
AOS devices as potential back ends for future balloon-borne or 
space experiments using heterodyne receivers. 
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Figure 2.- Deflection of optical guided wave as a function of 
surface acoustic wave frequency. 
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SUMMARY 


An automatic digital correlation plane processor is described that permits rapid 
location and identification of up to 99 different normalized autocorrelations. Up to 
99 unnormalized correlations are acquired in a single TV frame of 1/30 sec. Analog 
preprocessing circuits permit digital conversion and RAM storage of those video 
signals with the correct amplitude, pulse width, rising slope, and falling slope. TV 
synchronized addressing of 3 RAMs permits on-line storage of: 1) the maximum 
unnormalized amplitude, 2) the image x location, and 3) the image y location of the 
output of each of up to 99 matched filters. A fourth RAM stores all normalized 
correlations. 

Only the maximum correlation for each of 99 matched filters is determined in one 
TV frame of 1/30 sec. Successive TV frames look for the next largest correlation 
output for each optical matched filter. 

The silhouette of each image stored in the multiple optical matched filter is 
also stored in a pair of ROMs that represent the left and right side of the 
silhouette. Input image energy is collected through an optical high-pass filter of 
the same bandwidth as the corresponding optical matched filter by a gated, wideband 
video integrator. The image energy signal is routed to a ROM reciprocal look-up 
table. Normalization (division by image energy) is accomplished in 1 ysec by a 
dedicated digital multiplier. A partial out-of-field signal is available to abort 
normalization and to indicate the approach of loss of track. 


INTRODUCTION 


The optical matched filter (MF) (reference 1) has not found wide application 
because acceptable false alarm rates have not been realized by observation of the 
amplitude of the correlation peaks. When the illumination level associated with an 
unmatched input image greatly exceeds that of the matched image of comparable size, 
the MF generally yields a false alarm if MF output correlation peak is the only 
measure of acceptance. However, if we could divide the above autocorrelation and 
cross-correlation output peaks each by their corresponding input image energies-, the 
resulting normalized correlations would then be independent of input image energy, 
and for this situation, the normalized autocorrelation peak would be larger. 

The normalization approach described below does normalize the autocorrelation 
peak but yields an approximate normalization of the cross correlation peak. We, 
therefore, expect the normalized autocorrelation not to always be larger, but to 
always be unity, whereas the approximate normalized cross correlation will rarely be 
unity. The deviation from unity will be used as a measure of the uncertainty of a 
correct image match. To assist in reducing false alarms, other discriminants such as 
unnormalized correlation pulse width, rising and falling slopes, as well as 
unnormalized pulse amplitude are an integral part of the system. 
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PHENOMENOLOGIC MATCHED FILTER REVIEW 


Because we (1) exploit the automatic tracking feature of the MF, (2) use 
multiple MFs, and (3) employ band-limited MF's to normalize the correlation outputs, 
a brief description of the MF is in order* 

Figure 1 is one of the many possible MF configurations . ■ The back focal plane 
of the transform lens contains the on-axis, two-dimensional Fourier transform G(f x , 
fy) of g(x,y), the amplitude of the matched input image* When we signify the input 
image we refer to the image of the object of interest in the input scene* During 
fabrication it will be the only image in the scene* The spectrum G(f x , fy) is 
complex and two-dimensional and its phase can be recorded on high resolution spectro- 
scopic film by use of a second phase-locked collimated laser beam at angle 0 to the 
signal beam* The resulting stationary interference pattern shown magnified in figure 
1 contains a grating, or "subcarrier , n whose spacing depends on the angle between 
the signal and reference beams* Superimposed on the subcarrier grating are the very 
slight perturbations of the phase contributions of the Fourier transform of the input 
image* On playback with the reference beam removed and with the exposed and 
developed MF inserted in the identical fabrication position, the conjugate first 
order of the MF subcarrier is used as shown in figure 1* In figure 2 we fold the 
arrangement of figure 1 in line to demonstrate several points. First, for the 
matched input image the light exiting the MF is G(f x , fy) G*(f x , fy), where 
G(f x , fy) is the amplitude of the input matched spectrum and G*(f x , fy) is 
the amplitude transmittance of the matched filter. Therefore, all rays exiting the 
MF, regardless of their input direction (phase), are parallel* The consequence of 
this fact is that they focus to a point at the correlation output plane. The focused 
point in the correlation plane is the summation of all spatial frequencies and, 
therefore, all the input image energy less system transmission losses. This is not 
true for an unmatched input image and the cross-correlation peak will be reduced as 
well as broadened. For a band-limited MF with the low spatial frequencies shown in 
the magnified MF of figure 1 saturated (overexposed) , the autocorrelation function 
becomes a sharp point, as shown in figure 3(b) and compared to that of figure 3(a) for 
the full spectrum MF. In practice, the photographic MF plate has a dynamic range of 
the order of 1000:1 and the energy spectrum of most objects has a dynamic range of the 
order of lO^il, so that the full spectrum autocorrelation of figure 3(a) is not real- 
izable. Second, several MFs can be processed simultaneously through the use of a 
multiple holographic lens array that is equivalent to several overlapping lenses as 
shown in figure 2(b). Third, figure 2(a) indicates a transverse displacement of the 
input image, and, for F 0 = Fj_, the correlation will experience the same displacement 
with an inversion in the x and y directions. Therefore, input image position in the 
input plane can be determined by observing the position of the correlation output. 

By making F]_ < F Q , the mapping of the input image plane into the output correlation 
plane can be reduced to fit the output of several filters into the TV camera tube 
viewing aperture. 


NORMALIZATION APPROACH 


Parseval's theorem equates image integrated energy to its spectral 


f 
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integrated energy* The left side of the equation, image energy, is measured from the 
input image and provides the denominator for the normalized correlation* The right 
side, spectral energy, is measured by the autocorrelation peak to provide the 
numerator* Provision is incorporated to collect image energy in the same restricted 
band of the M F, Image energy band limiting is accomplished by selecting one of the 
100 holographic lenses for imaging* Instead of a MF at this beam we provide a high- 
pass or bandpass imaging filter of the same bandwidth as the MF, in the MF plane* 

The reference beam, on fabrication, is used to direct the band limited image on 
playback to an imaging lens to provide a different image plane scaling and a 
different TV pickup other than that used for the correlation plane* In a feasibility 
experiment now being assembled, a 2 x 2 holo-lens will provide three MF and one 
imaging lens that will share the same inverse transform lens for MF, as well as 
imaging, and will share the same TV camera for correlation and imaging* 

To collect the image energy, the TV video is integrated only during the time the 
desired image is scanned* Detectors are square-law devices, and the TV video voltage 
is proportional to energy* An earlier version of this concept restricted video 
integration to a rectangle that was centered on the desired image* The x-y RAMs 
provide the integrating window position over the image* The particular MF, as 
determined by the RAM address, determines the size of the rectangle* In this manner 
the matched image will always fall within the rectangle window even though an 
unmatched image may fall partially outside* However, since the input image and its 
background are a high frequency version of the actual input scene, we can expect a 
non-negligible contribution by the background around the matched image within the 
rectangle* To avoid this situation, and to guarantee that the normalization of the 
autocorrelation will be accurate and free of background, the shape of the rectangle 
was altered to that of the silhouette of the matched image* 

Two ROMs are used to store the silhouettes of all matched images* We will refer 
to one as the START ROM and the other as the STOP ROM* The silhouette in each 
interlaced field of a TV frame is located at contiguous addresses in the START and 
STOP ROMs. For silhouettes where a TV scan line leaves and reenters, an additional 
pair of ROMs will be required to avoid integrating background energy* A TV frame of 
1/30 sec is required to collect the image energy of each selected unnormalized 
correlation. Preprocessing circuitry limits the number of unnormalized correlations 
to be normalized® 


APPLICATION 


The need for normalized correlations can be appreciated if we have to select or 
detect a possible autocorrelation out of thousands of correlations in a few seconds® 
This was the motivation for the development of the system to be described® 

The OMFIC (Optical Matched Filter Image Correlator) is described in reference 2 
and is presented in schematic form in figure 4. The system is capable of processing 
an input image through 100 optical matched filters simultaneously. Therefore, an 
image of an input scene containing 20 objects will yield 2,000 correlations in 2 or 
3 ns at the OMFIC output plane. The output correlation plane is scanned by a 
conventional TV camera and all 2,000 correlations can be viewed on a TV monitor. 


95 



The problem addressed was to select any autocorrelations out of the array of 
correlations, and identify the image (associated matched filter) and the image x~y 
coordinates in a few seconds. Correlation normalization was one of the discrimina- 
tors used to reduce false alarms. If the matched images were trucks parked side by 
side, the resulting autocorrelations could be a few microseconds apart, and this 
ruled out the use of microprocessors to process the TV video. TTL with 20 ns 
switching time are adequate and, therefore, the bulk of the system described below 
consists of integrated TTL circuits. A yP is used to provide slow information in a 
few 100 ys time frame and to supervise overall operations, conduct searches, and 
provide displays and printout messages. 


SYSTEM DESCRIPTION 


Figure 5 is an overall system block diagram. The TV video is preprocessed so 
that only video pulses of the correct amplitude, of the correct pulse width, and with 
the correct rising and falling slopes will be accepted to be converted to 8-bit 
digital data in a 600 ns A/D converter. Figure 6 shows the preprocessing gate 
timing. The unnormalized correlations are stored in a 256 x 8 bit RAM. There are 
three additional RAMs, all 256 x 8 bits, and two of these are synchronously addressed 
with the TV raster sweep, along with the unnormalized correlation RAM as described 
below. Two of the RAMs store image x-y coordinates, and the fourth RAM stores the 
normalized correlations. 

Two timing counters accurately address the RAMs and locate the position of all 
correlations to the required precision on the TV raster. The horizontal counter 
consists of a digital phase-locked-loop (PLL) and provides the four least significant 
bits (LSB) for the RAM 8-bit address bus. The vertical counter is essentially a TV 
line counter providing the four most significant bits (MSB) for the RAM address. The 
H and V counts can be hardware or software altered to agree with the number and 
configuration of the matched filter array. A test pattern can be displayed on a TV 
monitor for system setup and assistance in matched filter fabrication output 
addressing. Such a pattern is shown in figure 7 for a 10 x 10 matched filter array. 
Each of the 100 boxes in the checkerboard test pattern represents a RAM address. The 
bright spot in the upper left-hand corner of each box (more visible in figure 3(b)) is 
the RAM erasure pulse of 100 ns duration of every other TV field. Since the RAMs are 
synchronously addressed with the TV raster, the data at any one address can be 
displayed on the TV monitor. The MSB of the data bus was connected to the TV monitor 
video input to demonstrate that the correlation peak was stored at the correct 
address. To ensure that only the peak of the video pulse is stored in each address, 
an 8-bit digital comparator tests the incoming 8-bit video peak with the RAM stored 
value and will dump the stored value if it is smaller. In this manner, only one 
correlation peak is stored in each address. If several correlation pulses are 
present for a particular MF (in the same box of the checkerboard), the stored value 
will be inhibited by the yP on the next TV frame after normalization. The next 
largest correlation peak of the particular filter will then be stored. After several 
TV frames all correlation peaks associated with each matched flter will be processed. 
The 8-bit comparator output pulse triggers a write pulse for the unnormalized and the 
x and y RAMs. Therefore, the maximum unnormalized correlation peak of each filter is 
stored in a RAM address assigned to that filter and the x-y coordinates associated 
with the correlation are stored at the same address in their respective x and y RAMs. 
At the end of each TV frame the RAMs are erased and the process is repeated. 
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During every other TV vertical blanking interval (once per TV frame) of 
approximately 1*7 ms all three RAMs (unnormalized correlation, x and y coordinates), 
together with the normalized correlation RAM, are scanned very rapidly by switching 
in a high-speed addressing counter* At each address the contents of the unnormalized 
and the normormalized RAMs are interrogated for any content* If no normalized number 
is stored where an unnormalized number has been found, then the normalization process 
will be initiated during the next TV frame, and the first step in the normalization 
process is the storage of 1 ) the RAM address, 2) x count, 3) y count, and 4) the 
unnormalized correlation* 

In figure 8, we define the image height and location, referenced to its 
location, as stored in the x and y RAMs* The left shaded side of the image, 
designated SR, is stored in the START ROM at contiguous addresses for successive TV 
lines in a field, and the interlaced field is stored at another group of contiguous 
addresses. The same image is stored at the same addresses in the STOP ROM for the 
right edge of the image, designated SP* When the horizontal sweep and, therefore, 
the x counter of the timing chain is equal to the value stored in the START ROM, the 
associated comparator output will switch on a high-speed, wideband video integrator, 
triggered by SR, and, in a similar manner, the STOP ROM will switch the integrator 
off, triggered by SP* The height, TL + BL, of the image triggers a height FF to gate 
in only the correct SR and SP ROM addresses* To correctly address the START and STOP 
ROMs for: a) the correct image, b) correct field, and c) correct positon in x and y 
requires offsets on all position data* Some of these are slow (TV field intervals) 
and are provided by the microprocessor* The 8-bit counts that are required in less 
than a TV line interval of 63*5 ys are hard wired* 

The RAM address identifies the MF and, hence, the image. The location of this 
image, in the START and STOP ROM, must be accessed as the TV sweeps through the 
image* The value Y 0 - T^ of figure 8 defines the image top line, and the yP 
provides the offset for the corresponding ROM location of this image. In this 
manner, the vertical timing chain line-counter is offset to correctly address the 
ROMs as the image is scanned* At the beginning of each TV scan line, the ROM stored 
value is subtracted from the X 0 stored value to provide a count for the leading 
edge of the image (X Q - SR). An 8-bit comparison of this value with the horizontal 
PLL timing chain identifies the left edge of the image to start the video integrator* 
In a similar manner (X Q + SP) will stop the integrator for that particular TV line. 
After a complete TV frame, the integrator output is converted to 8 bits and a 
reciprocal ROM provides the denominator to a fast (1 ys) 8 bit by 8 bit multiplier 
with the unnormalized correlation. Storage of this value in the normal correlation 
RAM completes the normalization cycle* During the TV frame that normalization is 
underway, the X-Y RAMs, as well as the unnormalized RAM, continue to be updated as 
before. The yP reads the normalized RAM and checks for deviation from unity. An 
appropriate video overlay circle or square can encircle the correlation peak, and the 
blinking rate of the overlay can indicate deviation from unity* 

The block diagram of figure 9 indicates how the operations described above are 
realized in a straightforward fashion* To determine if the image is partially cut 
off by the raster boarder, we evaluate four quantities. If (Y Q - T L ) < 0, then 
the image is partially off the top of the screen. Similarly, if (Y Q + B^) > 

244/k (where R is the number of rows of MF, and, therefore, correlation address 
rows), the image is partially off the bottom of the screen. If (X Q - SR) < 0, then 
the image is partially off the left side of the raster, and if (X Q + SP) > 100 (for 
a 100 count raster), then the image is partially off the right side of the screen. 

In all four of these conditions, the normalization will be aborted and an appropriate 
message will be indicated by the yP* This same information could be used for an 
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automatic tracker associated with the OMFIC that is about to lose its target and the 
direction to move to maintain the track is indicated® 


NORMALIZATION OF CROSS CORRELATIONS 


We can apply Parseval ! s theorem to the cross correlation situation 


/ / 


y x 


g 2 (x,y)g 1 (x,y)dxdy = / p / p 
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where g 2 (x,y) is an unmatched input image amplitude and g^(x,y) is the matched 
image* However, we do not have a single peak in the correlation plane that 
represents the integrated cross product because G 2 and do not have conjugate 
spectra, and the spectra do not have the same spatial frequencies* A cross 
correlation peak need not be an even function (as it is for an autocorrelation peak), 
and its location can be located almost anywhere within the input image if we map it 
back to the input plane* The ROM stored silhouette location is positioned by the x-y 
RAM stored correlation peak as indicated in figure 8* There are three different 
situations that we can encounter: 1) the extent of g 2 > the extent of g^, 2) the 
extent of g 2 < the extent of g^, and 3) g^ and g 2 do not completely overlap* 

In case (1) where the silhouette g^ falls entirely within g 2 , the input image, we 
do collect less than the full input image energy, and the denomination of the 
normalization is smaller than for a perfect g 2 silhouette® It is, therefore, 
possible to have a normalized cross correlation greater than unity® In case (2) we 
can have a smaller than unity normalization; however, we do collect background, and 
this can alter the normalization® The same is true for case (3)* 

Therefore, although we can make a definitive statement that the normalized 
autocorrelation is unity, we can expect the normalized cross correlation to vary 
considerably with some slight possibility of a value of unity® 

No effort was made to exploit the even symmetry of the autocorrelation peak in 
the preprocessor® If required, this extra feature can be implemented in a 
s tra ighf orward manner® 


CONCLUSIONS 


The automatic correlation plane processor described here can rapidly acquire, 
identify, and locate the autocorrelation outputs of a bank of multiple optical 
matched filters* The ROM stored digital silhouette of each image associated with 
each matched filter allows TV video to be used to collect image energy to provide 
accurate normalization of autocorrelations® The resulting normalized autocor” 
relations are, therefore, independent of the illumination of the matched input® 
Deviation from unity of a normalized correlation can be used as a confidence measure 
of correct image identification® 
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Figure 1,- Hatched filter configuration. 
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Figure 2.- Some aspects of matched filters 




Figure 3.- Wide band and high frequency autocorrelation functions. 




Figure 5.- Overall system block diagram. 
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Figure 7.- Ten by ten test pattern. 



X Q - x COORDINATE OF THE CORRELATION STORED IN X-RAM 

Y 0 = y COORDINATE OF THE CORRELATION STORED IN Y— RAM 

TL = TOP LINE OF IMAGE 

BL = BOTTOM LINE OF IMAGE 

SR = 8-BIT NUMBER STORED IN START ROM 

SP = 8-BIT NUMBER STORED IN STOP ROM 


Figure 8.- Definition of terms. 
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ABSTRACT 


An iterative optical vector-matrix multiplier with a microprocessor-controlled 
feedback loop is capable of performing a wealth of diverse operations. In this paper, 
we survey and describe many of these operations to demonstrate the versatility and 
flexibility of this class of optical processor and its use in diverse applications. 


1 . INTRODUCTION 


The optical vector-matrix multiplier [1] is a general purpose optical processor. 
The addition of a microprocessor-controlled feedback loop results in an even more 
general purpose and far more powerful optical processor [2-6], In this paper, we sur- 
vey and describe a selected set of the operations achievable on such a processor. 

In Section 2, a general description of the system is advanced. This is followed 
in Section 3 by a description of how bipolar and complex-valued data are handled on 
the system and how the convergence of the iterative algorithm is insured. We then 
address in Section 4 its use in the solution of linear difference and differential 
equations and linear algebraic equations. In Section 5, we consider application of 
this processor for the solution of the least-squares problem. In Section 6, we 
address its use for deconvolution and for the computation of the eigenvalues and 
eigenvectors of a matrix. In Section 7, our attention turns to the use of the system 
for matrix -matrix multiplication, the solution of linear matrix -matrix equations and 
matrix inversion. We then consider in Section 8 its use in the solution of nonlinear 
matrix equations with specific application to the linear-quadratic-regulator problem 
and algebraic Riccati equation of optimal control engineering. 


2. ITERATIVE OPTICAL PROCESSOR SYSTEM 


The general schematic of the ^iterative optical processor (IOP) is shown in Fig- 
ure 1. This figure illustrates the use of the IOP both as a vector-matrix multiplier 
and as an iterative processor for the solution of linear vector-matrix equations. 
Since an understanding of this architecture is paramount to the remainder of this 
paper, we review its operations. To multiply a vector by a matrix, the elements of 
the vector are used to spatially modulate a linear array of .light emitting dhLodes 
(LEDs) or laser diodes (LDs) at Pp. We denote the light distribution leaving Pp by 
the vector x. Plane Pp is imaged vertically onto Pp and the output from each LED is 
expanded to uniformly illuminate the corresponding row of a 2-D mask at P 2 . The light 


105 



distribution leaving each column of P 2 is then summed onto a different photodetector 
at P 3 , With the transmittance of P 2 specified by the matrix H, the vector output from 
the linear photodetector array at P 3 is the matrix-vector product H _x as described in 
[1] and earlier by others [7,8]. 

In our system [2,3], we have included feedback of the photodetector outputs to 
the LED inputs through an electronic feedback system. In our original description of 
this system [2,3], the mask was [ _I - H] , where I is the identity matrix. The P 3 out- 
put is then [_I - H]x(j), where x(j) denotes the vector x at the j-th iteration. An 
external vector x was added to this matrix-vector product in the electronic feedback 
system to form the new iterative input x(j + 1) • Our system thus implemented the 
iterative algorithm 


x ( j + 1) = [I - H]x( j ) + y = x( j ) - H x ( j ) + x* (1) 

In the steady-state when _x ( j + 1) - x(j), equation (1) reduces to II x- y and the out- 

put x is the solution 

x = H -1 y (2) 

of the vector-matrix equation 

H x = x* (3) 

In the newest [5,6] version of this IOP, we have: (1) used fiber optics to 

realize the required projection from P]_ onto P 2 (this greatly improves the system's 
accuracy as well as its mechanical and positional stability and reduces its size and 
weight); (2) placed the photodetector at P 3 in direct contact with the matrix mask at 
P 2 (this is possible when the 4 mm height of each photodetector is matched to the 
height of the matrix mask at P 2 . This further reduces the size and weight of the sys- 
tem and makes it even more stable for airborne applications) ; (3) included a micro- 
processor system with an arithmetic logic unit (ALU) , memory and hardwired multiplier 
in the electronic feedback loop (Figure 2), (this increases the system's flexibility 
and versatility); and (4) modified the feedback system and the iterative algorithm to 
incorporate an acceleration parameter to insure convergence of the iterative algorithm 
(Section 3) . 

When funding permits, we plan: (1) to increase the size of the P^_ input and the 

P 2 mask from its present 10 x 10 level; (2) to incorporate a real-time and reuseable 
spatial-light modulator electro-optical mask element (such as a CCD-addressed liquid 
crystal light valve [9]) into the system; and (3) to increase the repertoire of opera- 
tions and applications for the system. In Sections 4-8, we advance the first descrip- 
tion of a selected number of general operations and problems as well as applications 
for which this new optical processor can be used. 


3. CONVERGENCE AND HANDLING BIPOLAR AND COMP LEX- VALUED DATA 


In this section, we first detail how this non-coherent optical system can be 
used to operate on bipolar and complex-valued data [5,6]. Since the LED outputs at 
P-j^, the transmittance of the mask elements at P 2 and the photodetector outputs at P 3 
are all real and positive, and since the dynamic range of the mask at P 2 is finite, 
various scaling, biasing and data encoding techniques are required to process vectors 
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and matrices with bipolar and comp lex- valued data on the system. To handle complex- 
valued data, we partition the matrix H into its real and imaginary parts H r and 
respectively, as 


H = 


H -H. 

— r — 1 


H, 

— x 


H 

— r 


(4) 


where H and H. are bipolar. This requires a P 2 mask with four times the space-band- 
width product of H, an input LED array with twice the number of elements in x and a 
linear detector array with twice the number of elements present in y_. 

To handle bipolar data, we decompose the input vector x into its positive and 
negative components. Let _a + and a~“ denote the positive and negative components of 
the input vector a. used in the actual system. The M elements a-^ and a 2 m of and 
a” are generated according to 


(5) 


where the x m are the elements of the bipolar physical vector x* 

The system is then operated twice, first with aj_ as the input and then with a _2 
as the input (with the same physical mask B_ used for all cycles) . The outputs from 
the system on successive cycles are and 13 a. 2 • These outputs are: (1) subtracted; 

(2) scaled by (h - h) , where h and h are the maximum and minimum values of the ele- 
ments of the matrix H; and (3) biased by ^ ^ . The system’s output y_ after two 

successive cycles is thus — m=l m 




To insure that the transmittance of the physical mask B_ has a transmittance for each 
element satisfying 0 _< b mn < 1, we scale and bias B_ such that 



where h^ denotes the values of the elements of H and b mn denotes the transmittances 
of the physical mask B_ placed at P 2 . 

The second issue to be detailed in this section is how rapid convergence of the 
iterative algorithm is insured. This is achieved by modifying our original iterative 
algorithm to include an acceleration parameter (x) (as shown in Figure 1) . The new 
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system thus implements the iterative algorithm [4-6] 


x( j + 1) = x(j ) + “tz - H x ( j ) ] . 


( 8 ) 


We select 0) by one of the following criteria to insure rapid convergence of the algo- 
rithm. We can select 


to = -1/X 


max’ 


(9) 


where X is the maximum eigenvalue* in absolute value, of the matrix H, or we can 
max . — * 

obtain an approximation to (.9) by [10] 


X < 
max — 


| H | 


M N 

£ Eh 

— i mn 


1/2 


( 10 ) 


where | |h| | is the Euclidean norm of the (N x N) matrix H. The U) criterion resulting 
from the upper-bound in (10) can be modified by selecting 0) = ”KM max > where K > 1 is 
a constant selected empirically from analysis of a specific problem. (In several spe- 
cific case studies that we have considered, K - 2-3 was found to be adequate.) The 
microprocessor feedback system (Figure 2) performs the necessary scaling, biasing and 
preprocessing described by (5) and (7), the detector post-processing in (6), and the 
acceleration parameter selection noted in (10). In Section 6, we describe how the 
I0P itself can be used to calculate the acceleration parameter w in (9). 


4. SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS 


As our first general I0P application, we consider the use of the system of Fig- 
ure 1 for the solution of simultaneous linear (difference, differential or algebraic) 
equations. The general iterative algorithm for the solution of linear difference 
equations is 


x(j + 1) = $X (j) + f_ , (11) 

where j is the discrete-time index, _f is the forcing function equal to a constant vec- 
tor (or, more generally, a vector of time- functions) , x(j) is the state of the system, 
and ^ is the open-loop system matrix. 

The first application of the TOP of Figure 1 to the problem in (11) is as a dy- 
namic system simulator. In this case, we model the physical system by the state-space 
differential equation model 


dx 

— = Ax + b , (12) 

dt 


where A is the open-loop system matrix and the vector b is a constant or function of 
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time. We then utilize a numerical analysis algorithm to discretize this system model. 
We illustrate the approach by the f orward-Euler approximation [11] 


dx(t) 


dt 


t=jT 


x(j + 1) - x(j) 

T 


(13) 


where T is the constant time-increment (or step-size) between discrete samples de- 
scribed by the index j. Substituting (13) into (12), we can simulate the physical 
system modeled by (12) on the IOP of Figure 1 by the linear iterative algorithm 


x(j + 1) = [I + T A] x( j ) + Tb . (14) 

By analogy with (11), [I + T A] - and T b = f_ , Other numerical analysis algorithms, 
such as the trapezoidal rule, are possible. The most preferable ones appear to be 
the Runge-Kutta and predictor-corrector algorithms [12], 

The second application of (11) on the IOP is the iterative solution of linear 
algebraic equations. In this case, the iterative system algorithm of (8) is used. 

When rearranged in the form in (1) , we can identify the matrix and the f_ vector in 
(11) as [I_ - 0 )H] - j? and (qy - f_. The algorithms in (8) and (1) have been used [2,13] 
to calculate the set of adaptive weights necessary in adaptive phased-array radar sig- 
nal processing. While space does not allow us to elaborate on this application, we 
note that, in this case, the matrix H in (3) is the covariance matrix M of the far field 
input to the antenna, y is the steering vector s_ for the antenna and the unknown vec- 
tor x to be determined is the set of adaptive weights w to be applied to the received 
signals to steer the antenna in the desired direction (defined by s) and to null the 
noise field (defined in frequency, velocity, angle and range by M) . 

Four iterative algorithms to solve (3) for x given by (2) can directly be iden- 
tified. The first is the Richardson algorithm [14] of (1) implemented with the accel- 
eration parameter to as in (8) . The remaining three algorithms can be described in a 
new and quite useful formulation by decomposing the matrix H. into the sum of a diago- 
nal matrix I) and lower and upper triangular matrices _L and IJ as 

H = D - L - U . (15) 


In terms of the ID, Ij and U, we can write the remaining three algorithms as [15]: 


Jacobi: 
Gauss-Seidel : 
Overrelaxation: x(j + 


x(j + 1) = 
x(j + 1) = 

1) = j(D - 


|g hi + u] x(j) + D \ 

[CD - L)" 1 ^ x(j) + (D - L) _1 _ 


ooL) 1 [ (1 - «)D + wU] 


x(j) + 0 )(D - uL) 


(16a) 

(16b) 


3L« 

(16c) 


The choice of one of the four algorithms in (8) or (16) depends on many factors that 
are highly application and problem dependent (e.g., convergence of the algorithm, dy- 
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dynamic range of the mask, number of iterations required and the need for an adaptive 
mask) . 


5. LEAST-SQUARES PROBLEMS 


The solution to many physical and statistical problems can be formulated in the 
general context of a least mean-squares problem. A model is postulated to approximate 
measured data and the parameters of the model are selected to minimize the mean-square 
error between the measured data and the model. Curve-fitting and linear and poly- 
nomial regression are classical examples of least mean-squares problems. We formulate 
and address below the solution on our IOP of this class of important problems. 

We begin by reformulating our simultaneous linear algebraic equation problem 
solution in Section 4 as a least-squares problem, in the context of Figure 3, the 
input to the operator H is x and noise or an error source £ is present at the output such 
that the observable output y_ differs from * The problem is thus to find the x 
that minimizes the square of the difference 

J = | |y - H x| | 2 (17) 


between y and H jc . 


Noise or 
Error Source e 


Y w 

H 

1 

X IP 

— i 
H x 



FIGURE 3 Schematic diagram of 

a least-i 


X 


Upon expanding (17), we obtain 


J = - ltd [y - Hx] = y T y - y T H.x - x T H T y + xVhx 


(18) 


To minimize (18) , we set the partial derivative or gradient of the scalar per- 
formance index in (17) with respect to the unknown vector x equal to zero. The re- 
sultant column vector is then the solution of 
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or 


(19) 



In least-squares data processing, y is an M-vector, H is an (M x N) matrix and x is 
an N-vector, where M >_ N. We consider first the case when the number of unknowns N 
equals the number of equations M. In this case, H is a square matrix and the solution 
to (19) becomes 


x = 


T 

HE 


T -1 -T T -1 

H y « H H H y = H y 


( 20 ) 


where H is assumed by implication to be a non-singular matrix and therefore to be in- 
vertible. This occurs when the y = Hx problem being solved is well-formulated as 
occurs in all practical engineering problems. Thus, in this case, the solution x = 
H-iy minimizes the quadratic performance index in (17). For this case, we solve the 
least mean-square problem by our Richardson algorithm of (8) . 

A more interesting problem (corresponding to the practical case of curve-fitting) 
occurs when there are more equations M than unknowns N (i.e., when M > N) . In this 
case, H is non-square and therefore non- invertible . We thus solve (19) by the new 
iterative algorithm 
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T 

This is equivalent to pre-multip lying (3) by H and applying our iterative Richardson 
algorithm in (8) to the solution of [H T H]x = H^y rather than Hx = y_ . 

The least-squares fitting of experimental observations leads to a more general 
iterative vector-matrix solution. Suppose that we have L data points jd = {p^} and 
wish to approximate the set of observations z(p), in the least-squares sense, by the 
finite linear combination 


z(p) 


- £.h(p) 


( 22 ) 


of basis functions ^ = {(f) n } with the associated weighting coefficients _c = {c^}. In 
(22), each of the N basis functions are evaluated at the L data points {p^} and the 
relationship in (22) is approximate if there are more data points L than basis func- 
tions N. The errors or residuals [z (p) - cA^(p) ] of the curve-fit are the differences 
between the observed data z(p) and the approximation cT$(p) . According to the princi- 
ple of least-squares , we select the weighting coefficients in (22) to minimize the 
sum-of-squares of the residuals . To find the coefficients c_, therefore, we minimize 
the mean-square difference 


J = 


Data L 

w 


z(p l> ■ 




(23) 
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We proceed as before. Upon setting 3j/9_c = _0, we find 


z 

Data 

T 

4KP£>i (P£ 

c = 

Data 4<P * )Z<P * ) 

{p d 

— 


V 


H 

X = 



( 24 ) 


which is again of the general form Hx - y, with the summation over the {p^} data 
points incorporated into the matrix and vectors in the algorithm in (21). When the 
basis functions = {cj) n } are specified, the matrix H can be precomputed and the prob- 
lem solved by the iterative algorithm in (8) or (21) . A simple discrete example of 
arises in the calculation of the best straight-line fit through a set of data 
points. In this case, <j>^ = 1 and ^2 = P anc ^ z (p) ~ C 1 + C 2P* ^ simple continuous 
example case is the Fourier series expansion of z(p) , in which case the {(j) n } are the 
complex exponentials exp (+jnco^p) . 

In this section we have illustrated and formulated the least mean-squares prob- 
lem as the iterative solution of the system of linear algebraic equations Hx = _y on 
the IOP. We recognize that the range of applications of our formulation includes 
curve-fitting, linear and nonlinear regression, state and parameter estimation, orbit 
determiniation and signal processing in control and communication engineering. We 
will address these applications in our future work. 


6. DECONVOLUTION AND EIGENVALUE PROBLEMS 


The need to implement a deconvolution frequently arises • In such applica- 
tions (Figure 4) , the measured output data y_ = {y m } from a system characterized by 
the impulse response {h m } will be a modified version of the original input vector x = 
{x n >; i.e., 


m 


m 

Z 

n=0 


(m-n) 


x . 

n 


(25) 


In (25), we assume that the system is causal, otherwise we appropriately shift the im- 
pulse response so that h m = 0 for m < 0. The solution of (25) for the unknown input 
x is directly described by the vector-matrix equation y = H x_ , where (for the case of 
M - N - 3) the matrix H is shown below : 



(26) 
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FIGURE 4 Simplified schematic diagram of a deconvolution processing problem. 


In the general case of M > N = 3, the size of H grows, but H still has only 
three non-zero diagonals (with equal values Iiq, h^ and respectively* along each 

diagonal), and there are an additional (M - N) elements with zero value added to x. 

The solution of the deconvolution problem in (25) is thus directly realizable on the 
IOP by the iterative algorithm in (8) with the matrix mask in the form shown in (26). 
Once again, we see how different problems can be solved on the same IOP by different 
choices of the matrix mask and the iterative algorithm used. 

Another quite general and useful matrix operation is the calculation of the 
eigenvalues A n and corresponding eigenvectors of the (N x N) matrix H. This opera- 
ation can be performed by the iterative algorithm [16] 


x(j + 1) = H x (j) 


(27) 


which is equivalent to (8) with the exogenous vector y^ set equal to zero. To see how 
(27) allows calculation of the A n and _cj) n , we assume that | A-jJ is the largest eigen- 
value and that the eigenvalues are ordered, in absolute value, according to | A-jJ > 

| A 2 | > •••• > |A N | . By singular value decomposition, we write H and the original 
initialization vector x^(0) as 


and 


N T 

H = Z <j) . A . 6 . 

~ 1=1 -1 


N 


5l <0) - jIj. 


(28) 


After j iterations, j£(j + 1) = H^2£^(0) is obtained. Substituting (28) into this ex- 
pression, we find (for large j) 


x(j + 1) = 


= u3 


EHx^O) 


Large j 




(29) 


We form the component-wise ratio x^(j + l)/xx*(j) and from it find Ax (the largest 
eigenvalue of H) . We divide the denominator of the output vector x by the sum of the 
squares of its elements and thus obtain the principal eigenvector • We then use the 
new initial vector x (0) = x^(0) - a^<j)^ , Repeating the same iterative procedure in 


114 




(27), we then find X 2 and j)_ 2 • By continuing this process, all of the eigenvalues and 
eigenvectors of H can be determined in decreasing order. Modifications to this pro- 
cedure allow us to find the eigenvector whose eigenvalue lies closest to a prescribed 
value [16] . Many other extensions of these techniques and applications of the above 
results are possible and will be the subject of future work. 


7. SOLUTION OF MATRIX EQUATIONS 


Many problems involve matrix-matrix multiplication. There are two ways to real- 
ize a matrix-matrix multiplication on our vector-matrix multiplier. In the case of 
large matrices, the preferable technique is to form the product Y = M • of the 
matrices M and N by feeding sequentially the columns n n of N as successive input vec- 
tors. We thus realize the matrix-matrix product by performing N vector-matrix multi- 
plications; i.e., 


= M • N = M [n ± n 2 




(30) 


where the matrix output Y appears sequentially as N column vectors; i.e., Y = [y-] y 2 , 
• • Xn ] • A second technique is to vectorize the matrix N into an element vector 
whose first N elements are the elements of ni and the next N elements are the elements 

—Jl o o 


of n 


2 > 


9 2 

The matrix M is formatted as an (N x N ) matrix whose diagonal blocks 


are replications of the matrix M; i.e,. 


M * N 



n„ 


n„ 


% 




In 


(31) 


With a matrix-matrix multiplier realized by (30) or (31) , we can thus use the 
TOP to solve matrix-matrix equations such as 


HX = Y 


(32) 


by vectorizing the matrix Y or by operating the system on sequential cycles with the 
column vectors of Y as successive inputs. A particularly useful operation that is 
now possible is matrix inversion. In this case, we solve (32) using (8), but with 
Y = I_ (the identity matrix) . A final useful matrix equation that frequently arises 
is the solution for the embedded matrix X in an equation of the form 


A X B = Y 


(33) 


In this case, we write the matrices and X as the column vectors CjYj and CjX]> whose 
elements are the lexigraphically-ordered elements of the matrices Y and X, respective- 
ly. We solve (33) for X by writing (33) in the form 

[A © 1 T ]C[X] = C[Y] , (34) 


115 



where 

A, 



denotes the outer or Kronecker product. 


For the case of the (2 x 2) matrix 


A 






(35) 


T 

Solution of the Lyapunov matrix equation X A + A X = Y is now possible on the vector- 
matrix IOP. 


8. SOLUTION OF NONLINEAR MATRIX EQUATIONS 


In Section 7, we described how we have broadened the repertoire of operations 
achievable on the IOP to include the full-class of linear and embedded matrix equa- 
tions. As our final general iterative vector matrix operation, we consider its use 
in the solution of nonlinear matrix equations. The need for the solution of such 
equations arose in our original use of the system for adaptive phased-array radar 
processing [2] and in our present optimal control linear-quadratic-regulator applica- 
tion [4] . 

The general problem of two quadratic equations in the two unknowns p and q can 
be written as 


2 2 

f (p , q ) = a p + b pq + c q +dp+eq+r =0 
n n n n n n n 


(36) 


for n = 1 and 2. We rewrite this pair of two nonlinear equations in vector form as 

f [x] - 0 (37) 

T T 

where _f[x] = [fp(x) f 2 (ii)] and 21 = [p <?] • We then solve (37) by the Newton-Raphson 
iterative algorithm 

-1 

f[x(j)]. (38) 

2L(3 ) 


x(j + 1) = x(j) - 


3f 


3x 


The solution of (38) requires two iterative loops. The Jacobian matrix _J = [3f_/3_x] 
is stored algebraically. At each iteration, 2f is evaluated numerically with dedicated 
electronics at the last iterate 2l(j)* The inverse matrix [_J [x_( j ) ] ] — is evaluated on 
the optical vector-matrix system in an inner iterative loop. The new x.(j + 1) iterate 
is then evaluated optically in an outer iterative loop. 

Three immediate applications for such a nested two-loop iterative algorithm 
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arise. The first occurs in the implementation of the overrelaxation linear algebraic 
equation solution of (16c) with [_D - 0)L] on the left-hand side of the equation. In 
this case, a matrix-vector multiplication is required to obtain the external vector 
to be added. The second case arises in the use of (8) to solve for the adaptive 
weights w for a phased-array radar whose noise field is described by the covariance 
matrix M and the direction of its main lobe is defined by the steering vector _s. In 
this case, we solve the vector-matrix equation M • w = s_. When the noise distribution 
varies, the matrix mask M must be updated. 

The IOP application we are presently considering under support from the NASA- 
Lewis Research Center is the use of the IOP in the solution of the 2_inear-qjuadr a tic- 
regulator (LQR) problem of optimal control. In this application, the IOP is used to 
calculate the optimal controls to be applied to an F100 aircraft engine. To determine 
these control signals, we must solve the algebraic Riccati equation and calculate the 
LQR feedback gains. In this application, we use: (1) the vectorization of a matrix; 

(2) the Kronecker product technique; and (3) the nested inner and outer loop system 
to solve the nonlinear algebraic Riccati equation [4]. 


9 . SUMMARY 


In this paper, the general-purpose nature of an iterative vector-matrix pro- 
cessor has been emphasized. This system is capable of solving a wealth of general 
purpose applications. General operations described included: linear difference and 

differential equations, linear algebraic equations, matrix equations, matrix inver- 
sion, nonlinear matrix equations, deconvolution and eigenvalue and eigenvector compu- 
tations. Engineering applications being addressed for these different operations and 
for the IOP are: adaptive phased-array radar, time-dependent system modeling, decon- 

volution and optimal control. 
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INTERFEROMETRIC SURFACE-WAVE ACOUSTO-OPTIC 


TIME- INTEGRATING CORRELATORS 
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Harry Diamond Laboratories, 2800 Powder Mill Road 
Adelphi, Maryland 20783 


ABSTRACT 


A novel structure for a coherent-interferometric acousto-optic (AO) 
time-integrating correlator has been implemented by using a single surface 
acoustic wave (SAW) device with tilted transducers to reduce intermodulation 
terms. The SAW device was fabricated on Y-Z LiNbOg with a center frequency of 
175 MHz, a bandwidth of 60 MHz, and a time aperture of about 10 ys . The 
instantaneous bandwidth of the AO correlator is determined by the spacing 
density of the photodetector array, with a potential of 120 MHz. Typical 
integration times are 30 to 40 ms, providing processing gains in excess of 
10^. Such a device is very useful in providing fast synchronization of 
communication links. In addition, the device can demodulate to base band and 
simultaneously act as a synchronization lock monitor for moderate data rates. 
Where processing may be limited by doppler shifts, a two-dimensional 
architecture has been implemented to allow full processing gain. 


INTRODUCTION 


Current digital and microwave technology has made possible wideband 
communications for space applications. These systems present unique problems 
for which acousto-optics (AO) may provide solutions. The relative ease in 
applying multiple transducers to surface acoustic wave (SAW) delay lines allows 
novel architectures for such signal processing functions as correlation or 
convolution. Where large processing gain is required, integration in time 
rather than space permits time-bandwidth products in excess of 10 . Coherent, 
interferometric schemes provide both time (e.g., time-dif ference-of-arrival) and 
frequency information simultaneously. Two one-dimensional, SAW AO time- 
integrating correlators and a two-dimensional correlator incorporating these 
features have been constructed, and the results are presented. 
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ONE-DIMENSIONAL CORRELATOR ANALYSIS AND APPLICATIONS 


Two-input-beam time-integrating correlator 


Figure 1 illustrates the operation of a two-beam SAW AO correlator « A 
transducer is deposited on each end of the delay line (y-cut, z-propagating 
lithium niobate) at an angle with respect to the perpendicular to the z-axis 
of the Bragg angle in the delay line material^ at the correlator design 

center frequency, w . The relative tilt between the two transducers is 20 fi « 

The correlator input signals, A(t) cos to^t and B(t) cos out, generate counter- 
propagating SAW’s which interact with two sheet beams of laser light generated 
from a single laser and projected into the top side of the delay line with an 
angle of 40 g between them* Here, 0 fi is the Bragg angle in air for the 
frequency uu. Due to the strong angular dependence of the AO Bragg interation, 
the right sheet beam interacts primarily with the SAW launched by the right 
transducer; likewise, the left sheet beam interacts primarily with the SAW 
generated by the left transducer. Cross terms are down by 40 dB • The 
diffracted light is imaged onto an integrating, square-law photodiode array. 

For equal sheet beams of uniform intensity the diffracted light may be described 
by 


and 


L(t,z) = A(t - co s [ w 0 ( t - 


z sin 20 


■) - « A (t -*)] 


L 2 (t,z) 

z sin 
c 


= B(t 

20 b - ) 


+ -§) cos[ W£ (t + 


( 1 ) 


( 2 ) 


Here, to is the light frequency, t is time, z is the distance along the delay 
line and along the photodiode array (center is z = 0) , u is the acoustic 
propagation velocity, and c is the free-space light velocity. The photodiode 
array output current is proportional to the square of the sum 
of L^(t,z) and L 2 (t,z). The significant term, the cross-product proportional 
to L^(t,z) * L 2 (t,z), may be manipulated trigonometrically to generate frequency 
sum and difference terms. Upon time integration, only the difference term 
remains, yielding an output voltage proportional to 


V(t,z) = J t A(t --§) B(t +-§) 


2o) 0 z sin 20TJ 
cos [ — & — 


+ (u> A - Wg)t - 


(3) 


(“A + u^ dt 
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Since sin 20g = (oog/u) / 


(a^/c), and for co^ = this output voltage becomes: 
V(t,z) = COS (coq - W A ) ] 

(4) 


/ T A(t - B(t + -^)dt. 


Thus, the output voltage provides the correlation of the input signals 
(modulation and carrier) about the correlator design center frequency. Figure 2 
shows the autocorrelation of a bi-phase signal with 2,5 MHz bandwidth at 1 MHz 
above ujg • The bias level, produced by the square terms proportional to the 
integral of L^t^z) 2 + 'L At z)^ y depicts the Gaussian intensity profile of the 
sheet beams. This bias level may be easily removed by high pass filtering or by 
a subtraction of one output from a subsequent output produced with one input 
phase-inverted. The position of the fringe pattern is due to the relative time 
delay between the signals. Instantaneous bandwidths of 60 MHz at 175 MHz have 
been achieved with integration times of 30 to 40 ms for time apertures 
of 10 ys. 


Single-input-beam time-integrating correlator 

Since the angle of the incoming beam with respect to the SAW wavefront 
(transducer tilt) equals that for the diffracted beam, an inverse architecture 
using one input laser beam is possible, as shown in Figure 3. The output 
voltage from the photodiode array can be shown to be identical to that for the 
two-beam correlator as described by equation (4). Although intermodulation 
terms (the interaction of the SAW from the left transducer appearing at the 
right output diffracted beam and vice-versa) are suppressed by less than the 40 
dB achieved with the two-beam architecture, suppression is sufficient for most 
applications . 


The large processing gains, linearity, and time delay capabilities of the 
SAW time-integrating correlators make them useful as advanced signal 
processors. These correlators may be employed merely to provide synchronization 
in a communications system as a discrete version of the “sliding correlator" or 
may be used to demodulate the signal as well as to continually monitor 
synchronization. Figure 4 shows the output of the correlator, with bias 
subtracted , for inputs consisting of an information modulated , hybrid bi-phase 
wideband signal and an unmodulated (information free) reference signal. The 
overall bandwidth was 16 MHz with a bi-phase rate of 1.3 million bits per 
second. Integration was sychronized with the information rate. Input signal- 
to-noise ratio was -30 dB. 
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Two-Dimens ion al Correlator 


A two-dimensional, four product correlator has been demonstrated which 
features the interference between two doubly diffracted beams from perpendicular 
SAW delay lines. As shown in Figure 5, the initial source of the sheet beams is 
a single laser. Since the SAW devices are perpendicular, interaction in only 
one device is between light polarized perpendicular to the direction of travel 
of the SAW’s in that device. This is the preferred polarization; for light 
polarized parallel to the direction of SAW propagation, the diffraction 
efficiency may be somewhat reduced and a broadening of the Bragg angle 
dependence is experienced. 

The double diffraction effect"* is diagrammed in Figure 6 for the 
horizontal SAW device. The light from the first diffraction is of the form: 


z sin 

h Hi (t ,z) m A(t - -) cos[ W£ (t + - ) 


(5) 

+ W A (t “ • 

The doubly diffracted light is then of the form: 

L H 2 (t,z) = A(t - -J) B(t + -J) cos[(o £ (t + 

z sin „ „ (6) 

c + W A (t ~ u } + “B (t + u )] * 

Since 2(0g - © B2 )» then sin ^ « (w /u)/(u> A /c) - (u 2 /u)/(a)^/c) , 
and equation (6) reduces to : 


L H2 (t,z) = A(t - ^) B(t + -|) 


(7) 

COs[(u)£ + w A + Wg) t + £ A h ] , 


where & H = ( {to^ - w } - {co A - u>g}). Here, and w 2 are design center 
frequencies for which the Bragg angles are and 0^^ , respectively. 

m i and co^ are selected to be different so that the doubly diffracted light may 
be separated from the undiffracted beam. Similarly, for the vertical device, 
the doubly diffracted light is 
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Ly 2 ( t ,y) = C(t - -J) D(t + 

COS [ («£ + Wq + 03|)) t + Ay] , 


( 8 ) 



where Ay = ( {oa - u^} - {oo c - Those doubly diffracted beams are imaged 

onto a photodiode area array or a vidicon that integrates the output current 
which is proportional to the square of the sum of these beams * The frequency 
difference term derived from the cross product, which is proportional 
to Lrr (t,z) * L v (t,y), produces an output voltage 

V&,z,y) = f T A(t - A) B(t + A) C(t - A) D(t + A) 


COS [ ( 0J A + Wg- ft) c - (UjjJt + A h - A A v )dt, 


(9) 


If (co A + Wg) = (wq + Wg) » then equation (9) reduces to 


V(t,z,y) = cos [A A r - A A y ] f T A(t - A) 


B(t + A) C (t - A) D (t - A) dt. 


(10) 


For (o) i -o) 2 ) = (u) A -w B ) and (a) 3 ~io 4 ) = (co G — tu D ) , this further reduces to: 


J T A(t - i) B(t + £) C(t 


D(t +-J)dt. 




(11) 


Thus, this device provides a two-dimensional, four-product correlation. It has 
the additional ability to measure some degree of deviation from design 
frequencies. The usefulness of this correlator for processing signals 
containing doppler shifts in frequency can be shown by replacing the generalized 
input signals to the vertical SAW delay line, C(t) cos u^t and D(t) cos oj^t , by 
linear FM chirps, cos (u)q + at)t and cos (u^ - at)t. Equation (8) becomes: 


l'V2 < ‘ t,y ' ) = cos[(tO£ + u> c + tOjj) t + A a h 

- 4a <A)tJ . 


( 12 ) 


The output voltage then becomes 


V(t,z,y) = f T A(t - A) B(t + A) C os[o) A + 


0) B 0) C UJ D )t + y 

A A v + 4a(A)t]dt. 


(13) 
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If w = w* +0) where w is an unknown doppler shift and to A + 0)’ ® + w_ t 

then B equa?ion (?5) reduces DP to A B C D 


V(t,z,y) = / T A(t --§) B(t +-J) 
cosKwjjp + 4a -g)t + % A H ” u A vl dt * 


(14) 


It can be seen that there is a y position for which w = -4a ^ and the doppler 
shift is compensated* As an ambiguity function processor, correlation of a bi- 
phase code in the horizontal dimension provides range information, and doppler 
compensation in the vertical dimension provides velocity data. A four-product 
correlator has been implemented, and the output is shown in figure 7, 
where ^ and y are nonzero* The correlation spot has been moved from the 
center of the vidicon output by time delay (simulated range) and carrier 
frequency shift (simulated doppler). Bias subtraction has been performed using 
a video image processor • 


SUMMARY 

One- and two-dimensional interferometric time-integrating, acousto-optic 
correlators have been demonstrated. These offer large processing gains for 
application to wideband signal processing systems. Potential uses for 
synchronization, demodulation, and ambiguity function processing have been 
discussed . 
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Figure 2 . -Correlator output for direct sequence code at 00 


1 Mhz. 



Figure 3. -Single input beam SAW AO time-integrating correlator. 



DIGITAL " 1 ” DIGITAL "0" 


Figure 4.- Output of correlator for hybrid bi*-phase signals showing > 
demodulation. 
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Figure 5 .-Two-dimensional SAW AO time-integrating correlator. 


A (t -i) B (t +|) cos [(^ + to A + a) B )t +lA] 



Figure 6 . -Acousto-optic interactions producing a doubly diffracted beam. 


128 











NONCOHERENT OPTICAL PROCESSING OF RASTER FORMATTED SIGNALS 


* 


William J. Miceli 
Rome Air Development Center 
Hanscom AFB, MA 01731 

and 

William W. Stoner 
Science Appl ications , Inc. 

Bedford, MA 01730 

The inherent 2-D nature of optical imaging systems can be exploited in the 
processing of long duration signals by raster recording and parallel processing 
of complete frames of data. The 2-D Fourier transform of such a frame of data 
(i.e., the "folded spectrum") is most conveniently obtained with a coherent 
optical processor, however, the side effects of optical coherence (speckle, 
parasitic phase noise, linearity in complex amplitude, etc.) inhibit st ch an 
approach in many aerospace applications. These problems may be overcome by 
using noncoherent optical processors which use spatially incoherent light and 
capitalize upon the versatility of optical transfer function (OTF) synthesis. 
Incoherent optical spatial filtering of raster formatted signals is 
accomplished by imaging the raster recording with an optical system whose OTF 
modifies or selects out frequency components by means of a suitably designed 
pupil plane mask. To demonstrate the capability of incoherent optical signal 
processors, we present experimental results of a double cylindrical lens pupil 
mask which generates an OTF suitable for the selection of a single locus line 
in the folded spectrum. This OTF consists of three narrow passband ridges, 
with the center passband containing the dc component and the offset ones 

* Because this paper was not available at the time of publication, only the 
abstract is included. 
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corresponding to the spatial frequency of the locus line of interest. The OTF 
is probed with a rotatable Ronchi ruling which is imaged onto a vidicon camera 
at the output of the system. Unless the spatial frequency and the angular 


orientation 
of the OTF, 


of the Ronchi ruling corrsponds to the frequency offset passbands 
the input is completely filtered out. 
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THREE METHODS FOR PERFORMING HANKEL TRANSFORMS 


R. A. Athale, H. H. Szu and J. N. Lee 
Naval Research Laboratory, Washington, D, C. 20375 


ABSTRACT 

Generalized Hankel transforms are useful in analyzing the effect of circularly 
symmetric optical systems on arbitrary inputs. Some examples of such systems are 
complex laser resonators and space telescopes. Three methods for performing Hankel 
transforms with optical or digital processors are described. The first method is 
applicable when the input data is available in Cartesian (x-y) format and uses the 
close connection between generalized Hankel transform and the two-dimensional 
Fourier transform in Cartesian coordinates. The second method is useful when the 
input data is in polar (r - 0) format and uses change of variables to perform the 
n th orc [ er Hankel transform as a correlation integral. The third method utilizes 
the von Neumann addition theorem for Bessel functions to extract the Hankel 
coefficients from a correlation between the radial part of the input and a Bessel 
function. Initial experimental results obtained for optical implementation of the 
first two methods are presented. 


INTRODUCTION 

The analysis of complex optical systems is greatly facilitated by two- 
dimensional Fourier transform techniques. The effect of an optical system on 
arbitrary inputs is easily described by a transfer function in the Fourier domain. 
Generalized Hankel transforms are similarly useful when dealing with a circularly 
symmetric (or axisymmetric) system for arbitrary inputs. 1 This situation is 
encountered in performing mode analysis on the output of a slightly misaligned 
laser resonator as well as in aligning space telescopes. An optical method for 
performing mode analysis via generalized Hankel transform will have the unique 
advantage of preserving the phase of the wavefront to be analyzed. 

It is well known that when a two-dimensional function has circular symmetry 
(i.e. it depends only on the radial variable, r) , its Fourier transform is also 
circularly symmetric (i.e. it depends only on the radial variable, p) . It can be 
shown that in such a case the Fourier transform is equivalent to the 0th order 
Hanke 1 transform. 2 


If 


Then 


f(x,y) = f 


or 


& 


(iv) 


where { ) indicates two-dimensional Fourier transformation. 
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J 2 2 J~ 

lx +y , p = >u 


2.2 

+v 


& 



F (p) = 2 tt 
o 



o 


rdr f(r) J (p) 
o 


( 1 ) 


where F (p) is the 0 order Hankel transform and J Q (pr) is the 0 ^ order Bessel 
function of the first kind. Thus in dealing with circularly symmetric systems, the 
0 th order Hankel transform (which is a one-dimensional operation) can be used 
instead of the two-dimensional Fourier transform if the inputs also are circularly 
symmetric. For arbitrary inputs, this technique can be extended by using the genera- 
lized Hankel transform, which expands on the n th order transform (F„ (p) with kernel 
j n (p) in Eq. (1)). n 

The generalized Hankel transform, F nn (p), can be defined for an arbitrary 
function, f(r,0), as follows: 


f(r,e) = ^ ^ f Q (r) e 2n0 

n=-°° 


F 

nn 


(P) 



rdr f (r) J (pr) 
n n 


( 2 ) 


The generalized Hankel transform thus involves a Fourier series expansion in 0 , 
followed by an n order Hankel transform (with n ^ order Bessel function, J n (pr), 
as the transformation kernel) on the n^ coefficient of expansion, f n (r). This 
generalized Hankel transform is useful in analyzing systems with circular symmetry 
but when the input is not circularly symmetric. 

In the following sections we will describe three techniques for easy 
implementation of the generalized Hankel transform with optical or digital 
processors . These three techniques are each applicable in different circum- 
stances. Initial experimental results on the optical implementation of the 
first two methods will be presented. In conclusion, we will detail the course 
of future work in this area. 

OPTICAL IMPLEMENTATION OF GENERALIZED HANKEL TRANSFORM 

In optical processors a two-dimensional Fourier transform with respect to the 
Cartesian coordinates (x,y) is performed very easily with the help of a simple 
spherical lens. 2 The equivalence of two-dimensional Fourier transform and 0th 
order Hankel transform for circularly symmetric functions was established in the 
Introduction. A similar connection exists between two-dimensional Fourier trans- 
form with respect to Cartesian coordinates (x,y) and n^h order Hankel transform 
of the radial part if the function is of a* form f(r) e3 n0 . 
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(3) 


^jf(r)<>j - 2„F n (p) 


where 

2 2 ^ -1 
p = (u +v ) , <j> = tan (v/u) . 


and by definition 


F n (p> 


/ 


rdr f(r) 


J^(pr) . 


This connection arises out of the integral representation of Jn (pr) . 1 Using this 
result in the definition of generalized Hankel transform given in Eq. (2) the 
following relation is obtained between the two-dimensional Fourier transform and 
the generalized Hankel transform: 


00 



t tl 

Thus the generalized Hankel transform F nn (p) is equivalent to the n 
coefficient of Fourier series expansion of J^~(p,<j>) in variable <|> . Using 
this result, an optical system shown in Fig. 1 is designed to perform 
generalized Hankel transform on an arbitrary input that is available in 
Cartesian format (f(x,y)). The spherical lens performs a two-dimensional 
Fourier transform on f(x,y) generating (u y v ) , A suitably designed computer 
generated hologram then performs the Cartesian-to-polar (u,v->p ,<j>) coordinate 
transformation on^(u,v) generating , <j>) . 3 This is followed by a cylindri- 

cal lens which takes a one-dimensional Fourier transform with respect to variable 
cj> generating the desired output, F (p). 

Initial optical experiments established the connection between two- 
dimensional Fourier transform and n ^ order Hankel transform for a function 
of the form f(r)ed n ^. The schematic diagram of the optical system is shown 
in Fig. 2. A computer generated hologram was used to encode eJ n ® dependence 
of the input. The output was detected by a TV camera, which measures the light 
intensity in the Fourier plane of the input. The input used in these experi- 
ments had an r dependence given by 6(r-a), thus corresponding to a thin ring 
of radius "a" in the Cartesian (x,y) plane. The n t ^ L order Hankel transform of 
<5(r-a) is aJ n (ap), making the output easily understandable. Figure 3 shows the 
results corresponding to order Hankel transform (i.e. eJ n ® = 1). In Fig. 3(b) 
the function | J Q (ap) | ^ is plotted, which is then compared to a line scan through 
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the origin (Fig. 3(c)) of the output shown in Fig. 3(a). 


Very good qualitative 
Figures 


agreement between the theoretical and experimental results is obtained 
4(a-c) present the results for 1.^ order Hankel transform for the same f(r), 
but here the 0 dependence is e^ . Again good qualitative agreement is 
seen between theory and experiment. A line scan through the origin of the output 
corresponding to 2 nc ^ order Hankel transform of <5(r-a) (with 0 dependence ej20) is 
shown in Fig. 5. The zero at the origin was broader and the side lobes were seen 
to fall slower indicating that we indeed have |j (ap)p as expected. 

OPTICAL IMPLEMENTATION OF N TH ORDER HANKEL TRANSFORM 


The previous method is applicable when the input is available in Cartesian 
(x,y) format, since it involves performing two-dimensional Fourier transforms 
with respect to the Cartesian variables. If the input is polar (r,0) formatted, 
a more direct approach outlined in Eq. (3) has to be followed in obtaining 
generalized Hankel transform. The first part of the operation, which involves a 
Fourier series expansion in variable 0 , Is easily performed optically using a 
cylindrical lens. The calculation of n t ^ a order Hankel transform of the 
coefficient of expansion is less straightforward since it corresponds to a 
space-variant operation. So the main aim of the next two methods is to perform 
the space-variant operation of n^ order Hankel transform optically. 

a. Method Using Change of Variables 

One standard procedure used in converting a space-variant operation into a 
shift-invariant operation is to employ appropriate change of variables. In the 
case of order Hankel transforms the following procedure was described by 
Siegman for implementing the space-variant operation as a correlation integral 
on a digital processor. 4 From the definition 


using 


oo 

/ 


F (p) - # rdr f(r) J (pr) 
n / n 


ax oy 

r = r e , p = p e y 
o 5 o 


CO 

/ 


F n (y) = / f(x) J n (x+y) dx 


(5) 


where F n (y) = pF n (p), f (x) - rf(r), and J^xrfy) = arp J n (rp). The algorithm, 
therefore, consists of first linearly weighting the input f(r) and perform- 
ing r-^x ( logarithmic related) coordinate transform. This distorted input 
is then correlated with a similarly weighted and coordinate-transformed n^h 
order Bessel function to give the desired Hankel transform as well as linearly 
weighted and coordinate distorted forms. In any physical system the correla- 
tion integral will be performed over a finite interval, giving rise to 
truncation errors. Also if the input is sampled in the x-domain, the sampling 
rate should be adequate to represent the function accurately in x-domain. 
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These factors and others are discussed at length, especially for digital 
implementation, in Refs. 4 and 5. 

Since the operations of coordinate transformation and correlation can be 
performed by an optical processor, the optical system outlined in Rig. 6 can 
calculate n^ order Hankel transforms. Computer generated holograms are used 
to perform r-»x coordinate transformation as well as to encode the Fourier plane 
filter with impulse response j n (x) . The second dimension of the optical processor 
can be used to perform different order Hankel transforms on different inputs, thus 
achieving multichannel operation. 

In the initial experiment, both the input and the Fourier plane filter were 
encoded by computer generated holograms. The linear weighting and the coordinate 
transformation of the input was performed by the digital computer before doing the 
holographic encoding. The computer generated holograms used the Lee-Burckhardt 
technique 6 and contained 128 pixels. The optical system is depicted in Fig. 7. 

This system was then used to perform 0^ order Hankel transform on two different 
inputs, fq(r) = PiJ 0 (Pl r ) a ^d f 2 (^) = P2^o(P2 r )* result of 0 t ^ 1 order Hankel 

transform on fq(r) and f 2 ( r ) should be 6 (p-p^) and 6 (p-p 2 ) respectively. The 
results of the computer simulation of this algorithm are depicted in Fig. 8 . The 
finite width of the peak and the sidelobes are due to the finite limits of intergra- 
tion. To facilitate easy comparison with the experimental results, the 0 t ^ L order 
Hankel transform with linear weighting and coordinate distortion (i.e. |F Q (y)|2) 

was plotted versus y instead of F (p) versus p. Figure 9 shows the experimental 
results obtained. The optical output was detected by a 1024-element Reticon 
linear photodiode array. The shift in the peak position and the difference in the 
peak heights (due to the linear weight) are evident, indicating good qualitative 
agreement with the computer simulation results. 

b. Method Using Neumann Addition Theorem 

This method investigates an approach based on the special properties of Bessel 
functions . 7 If the input f(r) is correlated with J m (pr) then the m^ order Hankel 
coefficient F m (p) is obtained at the origin of the correlation plane for the 

particular value of p encoded in the Bessel function kernel. 


F (p) 
m 



rdr f(r) 


J (p (r+r 1 ) ) 
m 


r T =0 


The Neumann addition theorem for Bessel functions states that 


( 6 ) 


J (p (r+r 1 ) ) = J (pr T ) J (pr) (7) 

m m-n n 

n=-°° 

Substituting for J (p(r+r ! )) from Eq. (7) into Eq. ( 6 ) we get 
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( 8 ) 


/ 


rdr f(r) J (p(r+r')) 
m 


00 



rdr 


f(r) 


oo 




(pr') F n (p). 


(pr') J n (pr) 


Thus it is seen that the correlation plane contains an infinite sum of Bessel 
functions of different order weighted by Hankel transform coefficients of 
different order. So in principle it is possible to extract Hankel transform 
coefficients of different order out of a single one-dimensional correlation 
operation. 

The other dimension of optical system can be used to perform correlations 
with J m (pr) with different values of p to obtain complete Hankel transform. This 
approach is currently being evaluated further to determine the situations in which 
this method will be suitable. 


FUTURE WORK 

In this paper we have presented brief outlines of three different approaches 
for performing generalized Hankel transforms by optical or digital processors. 

We also presented initial experimental results on two of the three approaches. 

The future work will concentrate on carrying out the coordinate transformation 
required by the first two methods optically via computer generated holograms. A 
more quantitative analysis of the performance of the optical processor will be 
carried out. These three methods will be compared with each other with respect 
to space-bandwidth requirements as well as pre- and post-processing requirements. 
Finally, generalized Hankel transforms will be applied to specific problems, such 
as analysis of complex laser resonators, and optical systems will be developed for 
those problems. 
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INPUT FOURIER 

f(x,y) PLANE Hp'V F nn^ 

T(u.y) 


Figure 1. - Schematic diagram of optical processor for performing generalized 
Hankel transform via a two-dimensional Fourier transform with respect to 
Cartesian coordinates (x,y) . 




T 2 {f(r( e'" 9 } = F n (p) e-<"+ 

Figure 2. - Schematic diagram of experimental setup for establishing the 

th 

relation between n order Hankel transform of f(r) and two-dimensional 
Fourier transform of f(r) e^ n ^ with respect to Cartesian coordinates ( x >y)« 
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(a) Photograph of the output. 



the theoretically expected result. 



(c) Line scan through the origin of the 
pattern in figure 3 (a) , giving 

n 

If (p) I versus p. 
o 


Figure 3. 


Experimental 


results for 0 

F (p) of 6 (r-a) 
o 


order Hankel transform 
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Figure 5. - Line scan through the origin of the two-dimensional 


Fourier transform of <$(r - a) e 
6 (r - a) . 
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Figure 6, - Schematic diagram of an optical processor for performing 
order Hankel transform on f(r) employing change of variables. 




ure 7. - Schematic diagram of the experimental 

one-dimensional correlation between linearly w 

transformed input, f(X), and similarly weighte 

function J (X) . 

o 


erformxng 
coordinat 
ormed Bes 



gure 8. - Results of computer simulation of 0 
of p^J^Cpjr) and p^ J Q (p 2 r). The linearly we 

transformed Hankel coefficients, If (v)! 2 * ar 
Po = 2p- . 








(b) Input f 2 (r) = p 2 J Q (p 2 r). 


Figure 9. - Oscilloscope traces of output of the optical processor 

performing the 0 th order Hankel transform. Traces correspond 
. - 9 

to |F (y) j versus y for the two inputs. 
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OPTICAL RECOGNITION OF STATISTICAL PATTERNS 


Sing H. Lee 

Electrical Engineering & Computer Sciences 
University of California, San Diego 92093 


SUMMARY 

This paper describes the optical implementation of the Fukunaga-Koontz transform 
(FKT) and the Least-Squares Linear Mapping Technique (LSLMT) . The FKT is a linear 
transformation which performs image feature extraction for a two-class image class- 
ification problem. It has the property that the most important basis functions for 
representing one class of image data (in a least-squares sense) are also the least 
important for representing a second image class. The LSLMT is useful for performing 
a transform from large dimensional feature space to small dimensional decision space 
for separating multiple image classes by maximizing the interclass differences while 
minimizing the intraclass variations. The FKT and the LSLMT were optically imple- 
mented by utilizing a coded phase optical processor. Good experimental results were 
obtained, and they were compared with the performance of the matched filter and the 
average filter. 


I . INTRODUCTION 

Optical matched filtering, which is invariant to the translation of input, has 
been the main basis for optical pattern recognition for many years (refs. 1-3). Op- 
tical Mellin transform was later introduced to obtain scale invariant correlation 
(ref. 4). Currently, the subject of performing statistical pattern recognition and 
classification optically is found to be of considerable interest (refs. 5-9). For 
the two-class problem, we shall show in this paper that the Fukunaga-Koontz trans- 
form (FKT) can be implemented optically (ref. 9). For the K-class problem where K 
is greater than two, the optical implementation of the Least-Squares Linear Mapping 
Technique (LSLMT) will be presented (ref. 10). 

Figure 1 shows the hybrid system for optical implementation of FKT and LSLMT. 

The key element in the hybrid system is the computer generated spatial filter. The 
filter is synthesized from K training sets of images using the FKT or LSLMT algo- 
rithm. When the LSLMT filter is inserted in the filter plane and a test image in the 
input plane of a coherent optical processor (COP) , the output from the COP will be a 
bright spot of light in one of K predetermined locations, provided that the test im- 
age belongs to one of the K image classes in the statistical sense and it is illumin- 
ated with a random phase wavefront. As fast as one can input a new test image 
through a real-time interface device to the COP, a new bright spot of light indica- 
ting classification will appear among the K predetermined locations in the output. 

The COP effectively performs a matrix-vector multiplication in real time, where the 
matrix is (K x N) , the vector is (N x 1) , N(= n x n) is the space-bandwidth product or 
the number of pixels in an image, and K is the number of image classes. 

When the FKT filter (instead of the LSLMT filter) is used, the output from the 
COP will contain a number of light spots whose intensities correspond to the squares 
of FKT coefficients. The FKT coefficients are associated with basis functions 
(features) which possess the interesting property that the most important basis 
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function for one class is also the least important basis function of the other class. 
Image classification can become as simple as comparing two FKT coefficients associ- 
ated with the two basis functions of high separating power. 


II. THE FUKUNAGA-KOONTZ TRANSFORM 

A. Procedures for Calculating the Basis Function 

The procedures for calculating the basis functions of the F-K transform are sum- 
marized as follows. (Details can be found in ref. 9.) 

(a) Given a training set of images {X^^} consisting of image samples from 

class 1 (i = 1; j = 1, . .., M^) and samples from class 2 ( i = 2; j = 1, . .., , 

we first represent each sample image as a column vector of length N, and define 
matrices W^, and as follows: 


W, 


w. 


Y CD y (1 > Y (D 

~1 ”2 —3 


Y (2) Y (2) (2) 

-1 ™2 -3 


4' 


a) 


a) 


4 


( 2 ) 


and 


W. 


y (1) y (D 

-1 -2 


Y CD : (2) (2) 

X : -i -2 

I ♦ 


4 


( 2 ) 


where is an (N x M^) matrix, is (N x M^) and W is (NX + containing 

sample images from both classes. The sample correlation matrix of the whole process 
(computed with data from both classes) is then given by 


W W + = W W + + W W + , 


( 2 ) 


since 


w.w. + 

i x 


M-,- 


j=l 


x. (i > x. (i)+ 

"J 

. th 


i = 1, 2 


is the sample correlation matrix for the i class. Finding the basis functions 
normally involves diagonalizing the (N x N) matrix of (W t W t + ) , which is a very time 

consuming task. But, when the intra-class variation in each class is small, this 
time consuming task can be simplified to the following steps. 

(b) We find the (M^ + M 2 ) eigenvectors E and eigenvalues A of (W^wp, 

(W + W )E - EA. 


( 3 ) 


It can be seen from multiplying Eq. (3) from -the left on both sides by and re- 

grouping that diagonalizing (W^W^) gives (W^E) and A as the eigenvectors and 
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eigenvalues 


(W t W+) (W fc E) = (W t E) A 


(4) 


Therefore, the time consuming task of diagonalizing (W^W^) has become the much 
easier problem of diagonalizing the [ (M^ + M 2 ) x (M^ + M 2 )] matrix (W + W ) and calcula- 
ting the product (WE) . 

(c) By choosing the eigenvectors with the largest eigenvalues, we construct 
matrices and of dimensions [(M^ + M 2 ) X M^] and [M^ X M^], respectively. 

(d) The eigenvectors T, 0 and eigenvalues 1, M of the transformed single class 
correlation matrices are given by 


Fa 1 e 4 w + + (w 1 w 1 + )w e a 1 U = w 

L c c t ll tccj 

A -1 E + wT(W 0 W 0 + )W 4 E A _1 0 = GM 
Lc c t 2 2 t c c J 


(5a) 

(5b) 


It is shown in reference 9 that the eigenvectors of equations (5a) and (5b) are 
identical, 

0 = T (6a) 


and that the eigenvalues of equations (5a) and (5b) are related by 

M = 1 - T (6b) 

Thus, the eigenvectors with large eigenvalues applied to one class will have small 
eigenvalues when applied to the other class. 

H- -1 + + 

(e) The basis functions for the F-K transform are (ip. A E IV ) , where \p, 

^~± c c t -^i 

are the eigenvectors in ¥. 


B. Optical Implementation of the F-K Transform 

Since the F-K transform is a linear transformation, the coefficient correspond- 
ing to a specific basis function is found by calculating the inner product between 
an input image function and the complex conjugate of the basis function. A coherent 
optical processor can be designed to calculate these coefficients in parallel by 
multiplying the input image by a coded phase function exp [i(j> (x,y) ] , and designing a 

filter whose impulse response h*(-x,-y) consists of the complex conjugate of a sum- 
mation of shifted products of the coded phase function and a particular basis func- 
tion 


M 


h*( 


-x,-y) = f *( X + P A > y + qA)exp [-icj> (x + pA, y + qA)] 


pq 

th 


pq 


(7) 


where f (x,y) is the p,q basis function generated from displaying the eigenvector 
PT 
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(< A c lE c lw t +) a two ~dimensional format, cj) r (x,y) is a random function uniformly 

distributed from 0 to 2 , and A is a shift constant (see refs. 11-12 for more details 
of the coded-phase optical processor) . Since the coded-phase function has an auto- 
correlation of a delta function, a coherent optical correlator produces an output 
consisting of the transform coefficients space at intervals of A 


C(x T , y f ) = / / g(x,y) exp [i<f> (x,y) ] ^ f *(x- x ? +pA, y - y f + qA) 


exp [—<j> (x - x ! + pA, y - y f + qA) ]dxdy 


g(x,y)f *(x,y)dxdy 6 (x T - pA, y f - qA) 


Both the coded-phase distribution which illuminates the input and the spatial filter 
can be obtained by computer generated holograms. 

Experimentally the optical Fukunaga-Koontz transformation was applied to the 
problem of distinguishing birds from fish. Ten images of song birds were input to 
the computer through a T.V. digitizer system. These images formed the training set 
for class 1. Ten images of fish were also input to the computer to form the training 
set for class 2. These two sets are shown in figure 2. 

The F-K basis functions are shown in figure 3(a) along with the test images con- 
sisting of five new birds and five new fish in figure 3(b) . Since the basis func- 
tions can contain negative values, the pictures have been scaled so that black equals 
the most negative, and white the most positive. The grey level which corresponds to 
a value of zero is shown in the small square below each basis function. The eigen- 
values corresponding to the ten basis functions are given in table 1. We see that 
the best basis function with bird- type features is number 8. The best basis function 
with fish- type features is number 3. 

It is interesting to compare the third and eighth basis functions with the arith- 
metic average of each training set. Figure 4 shows that the most important F-K basis 
function for a class is similar, but not identical to the average filter for that 
class. This is expected since both means were retained in the training sets. How- 
ever, the F-K basis functions are not all positive. The grey level corresponding to 
zero is shown in the small square under each basis function. 

A filter was generated which contained six of the basis functions shown in 
figure 3(a) in phase-coded form. Basis functions numbers 3, 4, and 1 were chosen to 
represent fish-like features, and 8, 9 and 10 to represent bird-like features. This 
filter was placed in the filter plane of an optical correlator, and a computer holo- 
gram of the coded-phase array was placed in front of the input plane so that its re- 
construction illuminated the input. This is illustrated in figure 5. The output 
was detected with a T.V. camera, digitized, and displayed on a T.V. monitor. The 
digital computer can measure these six coefficients, and use them as input data to a 
linear or non-linear classifier for best classification. 

The results of using three of the ten test images in the coded-phase processor 


148 



are shown in figures 6(a)-(c). The lower right-hand corner of the T.V. monitor con- 
tains a sampled version of the actual light field detected by the T.V. camera at the 
output of the coded-phase processor. It consists of six points of light, where the 
square root of the intensity of each point corresponds to the absolute value of the 
coefficient for a specific basis function. The computer has added the lines around 
these points and the corresponding basis function numbers to help the display to be 
more meaningful. The measured values of the coefficients are plotted by the computer 
in the lower left-hand part of the screen. The input image has been reproduced in 
the upper left. 

A simple linear classifier based on the coefficients from basis functions 3 and 
8 was performed by the digital computer. This is drawn by the computer in the upper 
right-hand corner of the screen. The magnitudes of the two coefficients define a 
point in the two-dimensional space which is located by the mark "x" . Points which 
are located below and to the right of the dotted diagonal line are classified as 
birds, whereas points above and to the left of the line are classified as fish. It 
is clear from the figures that in each case, the combination of an F-K transform and 
a simple linear classifier leads to a correct classification of the input image. 
Figure 7 shows the combined result of 30 classifications, based on the magnitude of 
coefficients from basis functions 3 and 8. Twenty of the points are from the train- 
ing set of figure 2 and ten are from the test set of figure 3(b). A linear decision 
surface is able to separate the class "birds” from the class "fish" with no errors. 


III. THE LEAST-SQUARES LINEAR MAPPING TECHNIQUE 

The LSLMT is useful for performing a transform A, which maps an image vector 
from the i^ class in the large N-dimensional feature space as close to one 
specific unit as possible in the K-dimensional decision space such that the over- 
all mean-square error incurred in the mapping is minimized (see figure 8) 


- 


v. + e. . 

-j -i -ij 


(i = 1, 2, ... K; j = 1, 2, ..., M) 


(9) 


where £ is the error vector for the i ^ classifier and j ^ image sample. The algo- 
rithm of LSLMT involves maximizing the interclass differences and minimizing the in- 
traclass variations . 


A. Procedures for Calculating the Basis Functions 


The procedures for calculating the matrix A and the basis functions of the LSLMT 
are summarized as follows! (Details can be found in ref. 10.) 

(a) Given a training set of images } which consists of K image classes 

and M sample images in each class (i = 1, K; j = 1, . . . , M) , we define a (N x KM) 

matrix W as 


W 


-[■ 


y(l) y(l) 
% -2 ’ 
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(1) * (2) v (2) 


.X. 


X„ 






( 10 ) 


(b) We choose the unit vectors in the K-dimensional decision space to be or- 
thonormal, e.g., when K = 3, 
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(c) Knowing the vectors X. and V. from parts (a) and (b) respectively, 

K M ( . )+ 2 1 

calculate V' V. X. 

L—4 -a — J 

J T * -I 


we can 


(d) Knowing the matrix W from part (a) , we can find the eigenvectors ¥ and 
eigenvalues A of W~*"W 


W + W = V FAY + 


( 12 ) 


It is noted that W + W is a (KM x KM) matrix. 

(e) It can be shown that the inverse of the correlation matrix is given by 

K M 


E E VV 11 ' 

-i=i i = i J J 


L i=i j=l 


= W'FA Vw + 


(13) 


LSLMT. 


(f) Using the results of parts (c) and (e) , we can calculate the matrix A for 


r K M 


E E v.x.w 

r-i -i-j 


,x + 


K M 

E E x. (i) x. (i) 
i=i pi ~ 2 ~ 2 


L i=l j = l 

The basis functions are given by the row vectors of matrix A 


-1 


(14) 


B. Optical Implementation of LSLMT 


To employ the coded-phase processor for performing the matrix-vector multiplica- 


tion of equation (9) , we first need to convert the column vector X 


(i) 


corresponding 


to the input image g(x,y) back to its two-dimensional image format of (n x n) pixels 
and the row vectors of the matrix A to the pattern functions f (x,y) . The total 


number of pattern functions is K, i.e., p + q = 1, 2, . . . , K ? because there are only 
K row vectors in the matrix A. The output from the coded-phase processor will then 
be in a two-dimensional form with a spacing A between different light spots whose 
magnitudes of brightness are V., i - 1, 2, . K. The brightest spot indicates 

which class the input image belongs to. 


The key element required in optically implementing LSLMT is a computer genera- 
ted hologram filter whose impulse response contains the pattern functions f de- 


rived from the row vectors of matrix A. The advantages of optical classification of 
statistical patterns are parallel processing and real-time data rates. As an ex- 
ample, to recognize any unknown image of (n x n) pixels which has the same statistical 
properties as those images used in the training sets of K-classes, the amount of 
parallel computations involved is (K x n x n) multiplications and (K x n x n) additions 
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in real-time. When the real-time rate is 1/30 sec, K is 100, n is 500 and each pixel 

9 

has 64 gray levels (or 6 bits) , the computational rate is greater than 9 x 10 bits/ 
sec. 


Experimentally, the LSLMT was applied to design a classifier for hand-written 
letters in a variety of styles. Ten images of each of the hand-written characters m, 
t and a (as shown in figure 9) were input to the computer through a TV camera/digi- 
tizer system forming the training sets of three classes (detail description about our 
micro-computer based video-image analysis system are given in refs. 11-12) . Using 
the thirty image samples for the three classes and following the procedures outlined 
in Section IIIA, we calculated the matrix A for LSLMT. The three row vectors in A 
yield the three pattern functions f as shown in figure 10. Using these pattern 

functions we next generate a computer hologram filter whose impulse response is given 
by equation (7). The computer generated filter is shown in figure 11 and applied to 
the eight test images shown in figure 12. 

Three of the eight results from optical implementation of LSLMT are illustrated 
in figure 13. On the right the outputs from the coded-phase processor are displayed, 
given the test patterns on the left. Each output contains three spots of light in 
pre-determined locations. The square roots of the spot intensities correspond to the 
absolute values of V^ , and . The location of the brightest spot in the output 

plane clearly indicates classification. The measured values of V_^ for all eight test 

patterns are summarily listed in table 2 and plotted in figure 14. Based on table 2 
and figure 14, we can conclude that we have successfully demonstrated the optical 
implementation of LSLMT. 


IV. COMPARISONS AMONG VARIOUS OPTICAL PATTERN RECOGNITION METHODS 

We now turn to compare the LSLMT with other methods of optical pattern recogni- 
tion and classification. The comparisons cover the matched filter, the average fil- 
ter and the Fukunaga-Koontz transform. The first two comparisons were carried out 
digitally. The third comparison was optical. 

A. Comparison with the Matched Filter 

Two digital filters were generated with the impulse responses of the first let- 
ters m or t in the training sets shown in figure 9 (a) , (b) . All the letters of m and 
t in the training sets (fig. 9(a), (b)) and the test set (fig. 15(a)) were used as the 
inputs g(x,y). Figure 15(b) shows the normalized correlations between the inputs and 
the matched filters, which are defined by 

k k 

V^ = <gm^>/<m^m^> 

V 2 = <gt*>/<t 1 t*> (15) 

where <> signifies an inner product. Using the dotted line as the classification 
boundary will obviously yield zero error of classification. However, a more restric- 
tive .classifier defined as the ratio between V^ and V^ is often preferred because it 

is insensitive to scalar multiplication of g(x,y) which may result from changes in 
input illumination. This more restrictive classification boundary passes through 
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the origin, as shown by the solid line in figure 15, and yields 30% error in recog- 
nition. 

This digital matched filtering result is compared with LSLMT using the same set 
of letters in figures 9(a),(b), and 15(a). Figure 16 shows that the LSLMT can pro- 
vide error free classification. 

B. Comparison with an Average Filter 

Digital average filters were calculated by averaging the characters of the 
training sets in figure 9. The digital filters are shown in figure 17, which can be 
compared with the pattern functions f for LSLMT (fig. 10) . While the average fil- 
ters are formed by superposing only the images of the same class, to form pattern 
functions f for LSLMT, it appears that one may superpose not only the images of one 

class but also subtract the superposed images from other classes. 

Using the digital average filters, we obtain the correlations and between 

the averaged characters and the input g(x,y), which can be any image in the training 
(fig. 9(a), (b)) or test (fig. 15(b)) sets. 

h = Z m j >7< l £ m -jl 2> 

j j 

v 2 = <g E t * >/< l E fc jl 2> (16) 

j j 

Figure 18 shows the results of using digital average filters. The error rate is 20%, 
which is a little better than the 30% rate associated with digital matched filters. 
But, both digital average and matched filters yield results inferior to the LSLMT, 
which has a zero error rate. 

C. Comparison with Fukunaga-Koontz Transform 

This comparison was performed optically using the coded-phase optical processor 
and computer generated hologram filters. Two computer hologram filters were gener- 
ated using the same training sets of images of song birds and fish, one based 
on the principles of the FKT described in reference 9 and the other based on the 
LSLMT (ref. 10). The results of optically implementing the LSLMT are shown in 
figure 19. Two of the six test images used as input are shown in the left column. 

The corresponding outputs each containing two spots of light for this two-class 
classification problem are shown in the middle column. The linear classification 
results are shown in the right column. These linear classification results together 
with those for four other test images are shown in figure 20(a), which are to be com- 
pared with the linear classification results based on the FKT as shown in figure 
20(b). It is obvious that both FKT and LSLMT are good classifiers. The FKT has the 
advantage that it can provide many features for each image. These features can be 
used as the basis for a nonlinear classification routine. However, it is useful only 
for two-class problems. The LSLMT provides only K features (one feature for each 
class) , but it can be applied to multiclass problems (K > 2) . 
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V. SUMMARY 


We have shown that the Fukunaga-Koontz transform can be used as a feature ex- 
tractor in a two-class classification application. It was used for classifying birds 
and fish. After the F-K basis functions were calculated, those most useful for 
classification were incorporated into a computer generated hologram. A coherent op- 
tical processor was designed using this computer generated hologram to perform the 
F-K transform in real time. The output of the optical processor, consisting of the 
squared magnitude of the F-K coefficients, was detected by a T.V. camera, digitized, 
and fed into a micro-computer for classification. A simple linear classifier based 
on only two F-K coefficients was able to separate the images into two classes, indi- 
cating that the F-K transform had chosen good features. 

The Least-Squares Linear Mapping Technique has been optically implemented to 
classify large images also by utilizing the phase-coded optical processor and com- 
puter generated hologram filters. The principles of LSLMT are incorporated into the 
computer hologram filters. Although we have demonstrated here only two- and three- 
class problems, the LSLMT is useful for optical classification problems of many 
classes . 

A method was developed which simplified the computation of the FKT or LSLMT 
basis functions for large dimensional imagery, and was found to work well when the 
intraclass variation in each class was small, and the correlation matrix could be 
approximated by a sample correlation matrix of low rank. 

The advantages of optically implementing the FKT and LSLMT are parallel and 
real-time processing. In comparisons with the matched filter and the average filter, 
the LSLMT is clearly superior for classifying statistical patterns because it maxi- 
mizes the interclass differences and minimizes the intraclass variations. 
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Table 1. 

Eigenvalues Corresponding 

to Ten Basis 

Functions of FK.T 

Basis 

Function 

Eigenvalue (Xp 

\\ - 0.5| 

Best for 
which class 

l 

0.1354 

0. 3646 

fish 

2 

0.4846 

0.0154 

fish 

3 

0.0008 

0.4992 

fish 

4 

0.0034 

0.4966 

fish 

5 

0.9416 

0.4416 

birds 

6 

0.9883 

0.4883 

birds 

7 

0.9966 

0.4966 

birds 

8 

0.9998 

0.4998 

birds 

9 

0.9992 

0.4992 

birds 

10 

0.9985 

0,4985 

birds 


Table 2, The measured values of V.. , V, and V* for the eight test patterns shown 
in Figure 12. 


V V, V_ Largest for 

which class? 
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1.58 

0.02 

0.12 
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"2 

0.93 

0 

0.12 
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0.02 

1.1 

0.07 
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C 2 

0.07 

1.52 

0.01 

t 

C 3 

0.05 

0.95 

0.02 

t 

a l 

0.02 

0.01 

0.88 

a 

a 2 

0.03 

0.01 

1.43 

a 

a 3 

0.02 

0.01 

1.22 
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Figure 1.- The hybrid system for optical recognition of statistical patterns 
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Figure 2.- (a) Class 1 training set consisting of ten song birds. (b) Class 2 

training set consisting of ten fish. 
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(a) (b) 


Figure 3.- (a) F-K basis functions. Basis function 3 is best for class 2 (fish), 
and basis function 8 is best for class 1 (birds) . Small square to the left of 
image number indicates grey level corresponding to zero. (b) Test images 
consisting of five new birds and five new fish. 



Figure 4.- Comparisons of basis functions 3 (fish) and 8 (birds) with filters 
formed by the arithmetic average of the training sets. 


156 






< *3 > 



Figure 5.- Hybrid implementation of the coded-phase optical processor. A computer 
hologram of a coded-phase array is shown as CGH//1. A second computer hologram 
containing the F-K basis functions in coded-phase form is shown as CGH//2 . LCLV 
is a liquid crystal light valve for converting an incoherent (test) image into 
a coherent image. The resultant F-K coefficients are detected by the vidicon 
and analyzed by a digital computer. With the same CGH#2 but new test images, 
new F-K coefficients are obtained and new classifications are achieved in 
real time. 
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(c) 


Figure 6.- Optical implementation of the F-K transform using the coded-phase 
optical processor. The six basis functions with greatest separation power 
were used. Basis functions 3, 4 and 1 are best for fish, and 8, 9 and 10 
are best for birds. Three of the ten images are used as inputs in (a) , (b) & 
(c) , where the input is reproduced in the upper left-hand corner of the 
T.V. screen. The output of the optical processor is detected by a T.V. 
camera, displayed in the lower right-hand corner of the screen, measured 
by the video digitizer, and graphed in the lower left. A linear classifier 
using basis functions 3 and 8 is shown in the upper right-hand corner of 
the screen, with the dotted line separating birds (below and to the right 
of the line) from fish (above and to the left of the line). 
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COEFFICIENT OF BASIS FUNCTION NUMBER 8 


Figure 7.- Classification of birds and fish using coefficients of basis 
functions 3 and 8. Ten members of each training set, as well as ten 
members of the test set were classified* 



(a) (b) 


Figure 8.- The schematic diagrams of LSLMT for 3 classes. (a) N-dimensional 
feature space. (b) Three-dimensional decision space. 


m 



aaa o* & cl o* & a o- m 


Figure 9.- (a) Class 1 training set consisting of ten characters of m. (b) Class 
2 training set consisting of ten characters of t. (c) Class 3 training set 
consisting of ten characters of a. 



Figure 10.- Three pattern functions fpq for LSLMT. 


Since the pattern functions 


fpq contain both positive and negative values in general, bias levels (indi- 
cated by the small grey squares below fpq) are added to fpq to display them. 





Figure 13.- The result of optical implementation of the LSLMT for 3 classes. 
Three of the eight test images used are shown on the left. The outputs of 
the optical processor are displayed on the right. 



Figure 14.- Classification result with LSLMT for three classes (m,t,a) . The 
solid symbols of triangle, circle and square are for the training sets. 

The hollow symbols of triangle, circle and square are for the test images. 
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Figure 15.- (a) A test class of m and t characters. (b) Classification using 
the matched filter for 2 classes (m,t). The solid line represents a re- 
strictive linear classification boundary. The error rate is 9 out of 30, 
or 30%. 
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Figure 16.- Classification result with LSLMT of 2 classes (m and t characters) 





0.5 1.0 V| 


Figure 18.- Classification using average filters for 2 classes (m,t). The 

error rate is 6 out of 30, or 20%. 



j 




Figure 19.- The optical implementation of the LSLMT for two classes (Song Bird 
and Fish). The left column shows two of the six test images as input, the 
middle column shows the outputs, the linear classifier is shown in the 
right column. 





(a) 



COEFFICIENT OF BASIS FUNCTION NUMBER 8 


(b) 


Figure 20.- (a) Results of optically implementing the LSLMT for two classes 
(Song Bird and Fish) . (b) Results of optically implementing the Fukunage- 

Koontz transform using coefficients of basis functions 3 and 8, 
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OPTICAL SPATIAL INTEGRATION METHODS FOR 


AMBIGUITY FUNCTION GENERATION 
P.N. Tamura, J.J. Rebholz, O.T. Daehlin, and T.C. Lee 
Honeywell Corporate Technology Center 
Bloomington, MN. 55420 

SUMMARY 


An optical spatial integration approach to ambiguity function generation has 
been developed. It uses acousto-optic Bragg cells as input transducers and a special 
passive optical element called a space variant linear phase shifter to create time- 
delayed versions of one of the signals. Real time processing has been achieved. 


INTRODUCTION 


The ambiguity function x( v > T ) f° r two given signals f^(t) and ^2^ 
is defined by: 

/ oo 

fj.(t) f (t-x) exp (-j 2-rrvt) dt. (1) 

.00 

The ambiguity function is widely used in the processing of radar and sonar 
signals. A received signal f ^ (t) will differ from a reference signal fq (t) by a 
time delay and a Doppler shift corresponding to the range and radial velocity com- 
ponent of a target. When the v and x variables in (1) take on the appropriate values 
to compensate for the delay and Doppler differences between fp(t) and f 2 (t) > the 
ambiguity function will yield a peak value at the specific position (v,t) in the 
range-Doppler plane. 

Equation (1) imposes a severe computational load on digital signal processing 
hardware, especially in surveillance applications where timely computation of large 
numbers of ambiguity functions calls for near real time processing. Consequently, 
many optical schemes to perform the ambiguity calculation have been investigated 
over the years. These include both spatial integration approaches'^^ ? and more 
recently, time integrating architectures that utilize acousto-optic Bragg cells as 
input transducers . ' 

In this paper, a coherent spatial integration optical processor architecture 
that computes the ambiguity function will be described. The results of real time 
processing will also be shown. 


PROCESSING ARCHITECTURE 


Inspection of equation (1) shows that the operations to be performed by the 
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sign is opposite* 

Therefore, the space variant linear phase shifter can be accurately fabricated 
by cementing a cylindrical lens and a spherical lens of opposite power together, and 
orienting them at 45°. The focal length of the cylindrical lens should be half that 
of the spherical lens. 


ACOUSTO-OPTIC BRAGG CELLS 


The key element in any optical processing system is the input transducer which 
converts an electronic signal in time to an optical signal in space. The fastest 
such device known is the acousto-optic Bragg cell, which receives a temporal signal 
modulated on an RF carrier which propagates through the crystal as a shear wave, and 
space-modulates an incident optical beam. Speed is a critical factor in ambiguity 
function calculation, and since digital signal processor throughput will soon 
approach the video rates at which most two-dimensional spatial light modulators 
operate, Bragg cells, which have a frame time of 50 ys , are a very appropriate choice 
as input transducers. In addition to having a fast frame time, Bragg cells also have 
the advantages of electronic data composing and high diffraction efficiency. 

The Bragg cells used for all experiments were Te02 devices that operate in the 
slow shear mode with 50 ys frame times. The center frequency was 56.5 MHz with 
40 MHz bandwidth at the 3 dB points, giving a time-bandwidth product of 2000. They 
were designed to operate at 514.5 nm. 


REAL TIME PROCESSING 


The most Important design criterion in the real time system is the system time- 
bandwidth product (TBW) . An acoustic shear wave in a TeO^ Bragg cell will propagate 
with a speed of 617 meters/sec, while the practical physical window size is 30 mm. 

It takes only 48 ysec for the acoustic wave to propagate through the window. A 
sufficient system TBW, therefore, requires a fast data rate and high speed DACs. 

Fast 8 bit DACs commercially available operate at a 50 MHz data rate, and a digital 
time base compression (TBC) buffer of comparable speed must be constructed to operate 
with it. 

The real time system is designed around the optical ambiguity function genera- 
tor that utilizes two Bragg cells and the LIPS element. The front end electronics 
consisting of a TBC buffer, two digital-to-analog converters (DACs), and two RF 
driver/modulators, receives the input signals from the HP 9825 calculator In digital 
form and converts them to analog signals which cause index of refraction modulations 
in the Bragg cells which are proportional to the input signals. The rear end 
electronics, consisting of a two-dimensional detector array camera and a video 
memory, captures the ambiguity function, displays it on a CRT, and digitizes and 
stores it for examination. A block diagram of the real time system is seen in 
Figure 3. 

The processing rate of the real time system is limited by the detector array 
read out time to 500 frames/sec. Also, the HP 9825 cannot reload the TBC buffer fast 
enough to create 500 different frames each second. Instead, the same frame is 


168 



optical signal processor are a conjugation operation, a multiplication of f.. (t) by 
f 2 (t-x), and a one dimensional Fourier transform along the Doppler axis. These 
operations can easily be performed in the space domain by Fourier optics techniques, 
and an optical architecture capable of performing them is shown in Figure 1* 

The first operation, conjugation, is performed by placing each signal on a 
carrier frequency and passing the opposite diffracted order from each Bragg cell. 

This heterodyning with a carrier is necessary anyway since the signal data is invari- 
ably in complex form from a previous basebanding operation. The second operation, 
multiplication, is performed by imaging Bragg cell I telecentrically on to Bragg 
cell II. Finally, lens S5 takes a one-dimensional Fourier transform along the 
frequency axis to yield the ambiguity function. This architecture has been analyzed 
in formal Fourier mathematics in a previous paper 

The time-shifted version of £ 2 ^) is created by using a passive optical element 
called a space variant linear phase shifter (LPS) . The action of this LPS element 
can be understood by considering that its horizontal cross section is that of a 
prism. The result of imaging Bragg cell I onto Bragg cell II through a prism is to 
cause a position shift of the image. Since this is a spatial integration approach, 
a misregistration in the space domain is equivalent to a time shift in the time 
domain, and a delayed version of £ 2 ^) will be created. By continually varying 
the wedge angle of the prism along the vertical axis, a continuous range of delay 
values can be created. 

The feasibility of implementing the optical architecture of figure 1 depends 
heavily on the manufacturability of the linear phase shifter element. It is essen- 
tially an optical wedge; its wedge angle changes linearly with height. The complex 
transmissivity function of this component is given in rectangular coordinates by 

g(x,y) = e j “ y (2) 

where a is a constant. 


Conventional manufacturing processes such as grinding and polishing a glass 
piece would be difficult if not impossible to apply to the fabrication of such an 
element. We have invented a method to fabricate this component out of conventional 
optics, hence high accuracy of the transmitted wavefront is possible. 

By modifying equation (2), we have 

.a f \2 .a, 2 2. 

i~2 (x+y) -j; 2 ( x + y ) 

g(x,y) = e e (3) 

where the decomposition of equation (3) is gssentially the same as that used in the 
chirp-z algorithm. If we define r = x^ + y and introduce a coordinate system 
(x ! ,y ? ) that is rotated from (x,y) by 45° (figure 2), equation (3) can be rewritten as 

g(x,y) = e j ax ' 2 e _j 2 r2 . O) 


The first exponent in equation (4) is the complex transmissivity function of a 
cylindrical lens oriented parallel to the x 1 axis. The second exponent is a spher- 
ical lens. The cylindrical lens is twice as powerful as the spherical lens, and the 
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repetitively read to the Bragg cells to demonstrate high speed processing. 

The test signals used to demonstrate the real time processing capability of 
this system were synthetically produced complex random signals with varying amounts 
of noise added. The time bandwidth product was selected to be 256. The output of 
the 100 x 100 detector array camera was plotted in an isometric display format. One 
such plot for an input signal-to-noise ratio of 0 dB (signal power equal to noise 
power) is shown in figure 4. The optically produced results compare favorably with 
digital calculations, and their resolutions along both the delay and Doppler axes are 
in agreement with theoretical predictions. 


CONCLUSIONS 


A coherent spatial integration approach to ambiguity function generation has 
been described. It uses one-dimensional acousto-optic Bragg cells in conjunction 
with a space variant linear phase shifter optical element to generate the two- 
dimensional ambiguity function in one exposure. A real time implementation of this 
system has been demonstrated. 
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ABSTRACT 


The performance of a fully integrated rf optical spectrum analyzer (I0SA) is 
presented. This device exhibited a 3 dB bandwidth of 380 MHz with a diffraction 
efficiency of 5% (at 500 MW rf power) . The fully integrated device operating at 
X - 0.82y exhibited the following performance figures: an acousto-optic bandwidth 

o? 380 MHz, a 3 dB spot size of ^8 MHz, a two-tone resolution of 8 MHz, and a linear 
dynamic range of >25 dB as observed visually on an oscilloscope. The resolution 
uniformity across the rf bandwidth is presently limited by the mechanical position 
errors of the semiconductor laser and the detector/CCD array in the focal planes of 
the collimating and focusing geodesic lenses. 

All components used in the I0SA were first tested individually and reported here, 
A fully functional (with no dead detector cells) detector /CCD array exhibited a 
linear dynamic range of 30 dB, with a sensitivity uniformity of ±1 dB. A measurement 
of the undeflected zeroth order beam indicates geodesic lens insertion losses of 
<2 dB each. The acoustic transducer exhibited a 400 MHz 3 dB rf bandwidth. 


INTRODUCTION 


The use of guided-wave optics resulted in the development of a new class of one- 
dimensional optical processors that have a rigid, compact optical design and a low 
electrical drive power requirement. This type of processor is most easily applied to 
the analysis of rf signal spectra. In this paper we describe an integrated-optics, 
circuit for real-time rf spectral analysis (ref. 1,2). 

An integrated-optics optical spectrum analyzer consists of an injection laser 
diode, a thin-film optical waveguide, waveguide lenses, a surface acoustic wave (SAW) 
transducer, and a linear detector array. This device employs the interaction between 
a coherent optical wave and an acoustic wave driven by an input electrical signal to 
determine the power spectral density of the input. The basic design layout of an 
integrated-optic spectrum analyzer (IOSA) is shown in Figure 1. An incoming radar 
signal is mixed with a local oscillator so that the intermediate frequency (IF) is 


173 



within the passband of the transducer. After amplification, the signal is applied 
to the SAW transducer. The resultant SAWs traversing the optical waveguide generate 
a periodic modulation of the refractive index of the waveguide mode. If a collimated, 
guided optical beam intersects the acoustic beam at the Bragg angle, a portion of the 
beam will be diffracted (deflected) at an angle closely proportional to the acoustic 
frequency, and with an intensity proportional to the power level of the input signal. 
The Bragg deflected light is then focused onto an array of focal plane detectors 
where each detector output becomes one frequency channel of the spectrum analyzer. 


ACOUSTO-OPTIC INTERACTION 


A diagram of the acousto-optic interaction region in a monolithic integrated 
optic structure is shown in Figure 2. The criterion for an efficient Bragg diffrac- 
tion is that the interaction length L be sufficiently large so that the parameter Q 
is larger than 1; i.e., 


Q = 


LXo 


> 1 


nA cos 


where n is the waveguide modal index, A is the acoustic wavelength, Xo is the optical 
wavelength, and 0^ is the incident angle of the optical beam defined in Figure 2. 

This criterion physically corresponds to an interaction length larger than the 
acoustic wavelength. The diffracted energy appears predominantly in one order. 

When the acousto-optic interaction is phase matched in first order Bragg diffrac- 
tion, the angle of incidence, 0., and the angle of diffraction, 0^, are equal. The 
angular deflection 6 (the angle between the incident and diffracted beam, or twice 
the Bragg angle) is given by 


6=0. + 0 , =2 sin 
l d 


-1 



This angle is different for each acoustic wavelength or frequency. For a fixed inci- 
dent optical beam direction, the direction of the acoustic wave must change with 
frequency. Within the small angle approximation, the deflection angle is linearly 
related to the acoustic frequency by 



df nv 


s s 


where f is the acoustic frequency, and v is the velocity of the surface acoustic 
wave. S When the acousto-optic medium exhibits significant dispersion in v the 
deflection equation takes the form 
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This indicates that nonlinear spacing of detector elements in the focal plane must be 
used for dispersive acoustic materials. 

A complete Fourier transform analysis for this device was reported earlier 
(ref, 2). In this device the deflected optical beam generated in the Bragg cell 
modulator is proportional to the product of the incident optical beam wavefront, the 
accoustic wavefront, and the efficiency of the interaction. Therefore, as a result 
of the acousto-optic interaction, a fraction of the transmitted light beam acquires 
a new direction of propagation, corresponding to a bandwidth, df , centered at frequency 

f . S ' 

s 

The output intensity distribution thus displays, at a position linearly propor- 
ional to the frequency of the modulating signal, a signal whose intensity is propor- 
tional to the intensity of the deflected beam (i.e., proportional to the spectral 
component of frequency f , multiplied by the efficiency of the Bragg modulator). For 
a modulator with a flat response over the frequency range of interest, the output is 
thus proportional to the intensity of the spectral component. 

DESIGN PARAMETER STUDY 


The optical circuit design of an IOSA device is constrained by the required rf 
frequency resolution and the optical resolution limit set by the allowed crosstalk 
between adjacent cells. The rf frequency resoltuion requirement sets the upper limit 
of the detector cell size S 


FA 

S < Af ; 

~ nv 

s 


while the optical resolution limit requires a minimum cell size determined by 


S > 


gX o F 

nD 


where D is the optical beam width, F is the output lens focal length, n is the effec- 
tive guide index, and A is the free-space wavelength. The resolution factor g is 
determined by the definition of optical focal spot resolution. These equations have 
been used to design an IOSA structure having 4 MHz rf resolution (ref. 2) for an IOSA 
fabricated on an LiNb 03 substrate, where the acoustic phase velocity is 
v s = 3500 m/sec. For this device with a Gaussian beam truncated at the 1/e^ 
intensity points, the minimum beam width was 2.18 mm. 


175 



When the minimum optical beam width, D, is selected in the design, then the 
spot size of the optical beam at the detector array will just equal the detector cell 
size S. The adjacent channel crosstalk is defined as the total optical power col- 
lected by the cell adjacent to a cell corresponding to a given rf frequency divided 
by the total optical power collected by that rf frequency cell. This adjacent chan- 
nel crosstalk ratio is plotted in Figure 3. The adjacent cell crosstalk is plotted 
as a function of DS. Thus, for an optical beam width of 2.18 mm, and a cell size of 
8.0 ym (equal to the full lobe spot size at the focal plane), the crosstalk is 
-22.4 dB . If the lens quality is such that a lens diameter larger than D m _^ n can be 
accommodated while diffraction-limited performance is still maintained, then the side- 
lobe intensity impinging on adjacent cells will be reduced. This will allow two 
closely spaced signals of significantly different energy levels to be detected and 
resolved. For a 4 MHz resolution and a cell size fixed by the full main lobe width 
of the minimum diameter beam (D m j_ n = 2.18 mm), a 5.6 dB reduction in adjacent chan- 
nel crosstalk results when the optical beam width is enlarged to three times this 
minimum value, or D = 6.54 mm. For this case, the adjacent channel crosstalk is 
-28 dB. 

We have established that the value of Sn/A F necessary to achieve 4 MHz rf 
resolution is 


Sn , in +2 -1 

t — — = 11.4 x 10 m 
A r 


where the ratio S/F for an LiNbO system (n - 2.2) operating at A = 0.82 ym is 
S/F = 4.25 x 10“^. Since the detector cell size, S,is set by detector array design 
considerations, the focal length of the Fourier transform lens is required to be 
F = (S/4.25) x 10^, which, f or an 8-ym cell size, is 18.8 mm. If the optical beam 
diameter is chosen to be 7.4 mm, then the f/no. of the output transform lens is 
f/2.54. For this system, the adjacent cell crosstalk due to the optical beam diffrac- 
tion pattern is below -28 dB. 


PROPAGATION MEDIUM 


An important issue in the development of an I0SA was the selection of the 
optimum substrate material system from the available candidates. The material 
selected for this device was a y-cut LiNb 03 substrate with propagation direction along 
either the z axis or 21.8° from the z axis. LiNb 03 is considered to be one of the 
best acousto-optic materials because of its low acoustic propagation loss, high 
electromechanical coupling constant, and the relative absence of velocity dispersion 
due to the optical guiding layer. With this substrate material the detectors are 
interfaced by direct butt coupling to the LiNbO^ waveguide. For y-cut crystals, 

LiNb 03 substrates have different SAW characteristics for propagation along the z axis, 
and for propagation 21.8° off the z axis (ref. 3). An analysis of acousto-optic 
modulation efficiency indicated that there is no significant difference between these 
orientations. From the optical standpoint, the z-oriented acoustical transducers 
minimize the substrate anisotropy and are preferred. 
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The waveguide on the polished y-cut LiNbOo substrate is formed by in-diffusion 
of electron-beam-evaporated Ti (ref. 4). A fabrication sequence is used in which 
the important parameters (such as surface quality, optical flatness, cleanliness, 
uniformity, and adherence of the metal films to the substrate) are controlled through- 
out the process. Single-mode optical waveguides are formed by diffusing 200-A- thick 
Ti at 950°C for 4 hours. The diffusion depth of the resulting waveguide is about 2 pm. 

The propagation loss in Ti:LiNb 03 waveguides at the He-Ne laser wavelength 
(A = 6328 A) is typically less than 1 dB/cm. The waveguide loss for a TE polarization 
wave (He-Ne laser wavelength) was experimentally determined to be 0.85 dB/cm (ref. 2). 

In-plane scattering of the optical waveguide is an important parameter in 
determining the dynamic range of the I0SA. He-Ne laser light was used to excite the 
waveguide mode and the strength of the in-plane scattering was measured by scanning 
the m-line extracted by a coupling prism. The scattering intensity shown in Figure 4 
is -40 dB for a scattering angle of 0.6° in the waveguide, corresponding to an 
acousto-optic deflection angle due to an acoustic frequency of 115 MHz. In-plane 
scattering levels of -49 dB and 57 dB of the peak intensity were measured at an 
in-plane scattering angle of 1.3° (corresponding to deflection by a SAW having a 
frequency of 270 MHz) on each side of the main beam. When the I0SA is operated at 
the GaAlAs laser wavelength, the in-plane scattering level is further reduced. 


LASER CHARACTERIZATION 


To meet the frequency resolution specification, the total output spectral width 
of the laser should be much less than 40 A under operating conditions. Ideally, the 
output should be in a single transverse and single longitudinal mode with spectral 
width <1 A. The lasing aperture should be as small as possible to simulate a point 
source so that the problem of collimating the laser output in the thin-film waveguide 
can be simplified. Based on these considerations, we selected the Hitachi GaAlAs 
buried-heterostructure injection laser diode as our source. The lasing aperture 
of the device is 'Ul ym in diameter with nearly isotropic divergence. In this pro- 
gram we tested two special-order lasers that have an emitting edge flush with the 
heat sink edge. They are double-heterojunction lasers, Model HLP2400, Serial Nos. 3831 
and 3834. The far-field intensity distribution in the two principal orthogonal 
directions (perpendicular and parallel to the junction) was measured using a photo- 
multiplier and a slit with an effective angular resolution of 0.115°. Figures 5 and 
6 show the far-field intensity profile measured at HRL. The beam divergence in the 
plane parallel to the junction is less than 20° for both lasers. In the plane normal 
to the junction, the beam divergence is 30° (Serial No. 3834) and 37° (Serial 
No. 3831), respectively. 

There is an upper limit in the laser power coupled into the waveguide at which 
the detector cell is saturated. The laser power requirement can be estimated based 
on the following values: a saturation charge density of the detector cells of 

6.0 x lO^ electrons/cell, a detector quantum efficiency of 0.25, a maximum acousto- 
optic deflection efficiency of 0.05, an optical propagation loss of 9 dB , and a 
detector integration time of 3.0 ysec. The calculation shows that about 600 yW of 
laser power has to be coupled into the waveguide. 
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WAVEGUIDE GEODESIC LENSES 


Waveguide geodesic lenses are waveguide-covered depressions (or protrusions) . 

The curvature of the depression generates a difference in geometric path length for 
rays traversing different portions of the structure; this produces a focusing effect. 
The optical path length at the center of the depression is longer than at the sides; 
and the phase fronts are curved toward the lens axis beyond the depression. If the 
depression is of a particular aspheric shape, or if the wavefront emerging from the 
line is appropriately compensated by external over layers, the focusing will be aber- 
ration-free (except for field curvature) . 

The design of the aspherical depression profile for a diffraction-limited 
geodesic lens is based on the equivalent-lens principle (ref. 5). From a generalized 
Luneburg lens profile calculated by numerical computation, one can derive the equiva- 
lent aspherical geodesic lens profile. The geodesic lens has no chromatic aberra- 
tions because of its uniform refractive index. This is independent of dispersion in 
the material refractive index. The focal length of the geodesic lens is independent 
of the modal index, and it can be used in both single- and multi-mode optical 
systems , 

Diffraction-limited geodesic lenses have been demonstrated in two different wave 
guide structures: polyurethane film-glass substrate (ref. 6) and Ti in-diffused 

LiNbOg waveguide (ref. 7). The focal-length measurements (ref. 7) indicate that the 
lens is aberration free up to about 90% of the aperture. Two waveguide geodesic 
lenses are used in the I0SA: one collimation lens and one Fourier transformation 
lens. Both lenses are identical, and their parameters are listed in Table 1. Since 
the aspherical lens profile has a smooth transition region that takes up 15% of the 
lens aperture, the useful lens aperture is 7.4 mm. 


FABRICATION OF ASPHERICAL DEPRESSIONS 


The fabrication of geodesic lenses involves two stages; grinding and polishing 
the lens depression with high precision. Ultrasonic impact grinding has been 
selected for both of these operations. The key problem appearing in the optical 
polishing is to obtain an optical surface without destroying the complex lens profile 
in particular, the edge rounding region. 

Ultrasonic impact grinding employs a tool bearing the shape of the designed 
depression. Ultrasonic energy transmitted through the tool agitates the abrasive 
slurry, and material is removed from the sample surface. The ultrasonic impact grind 
ing technique is capable of replicating the tool shape within a 1/4-pm tolerance 
(ref. 6). The operation requires more than one tool to reduce the effects of tool 
wear . 


Ultrasonic impact grinding is a complicated process governed by many parameters. 
We have empirically determined the optimum values of tool shapes, tool materials, 
grinding pressures, stroke amplitudes, workpiece materials, slurry speeds, abrasive 
concentrations, and grit sizes for our specific lens designs. Double lens pairs 
have been ground on 6 . 35 cm x 3 . 81 cm x 0.318 cm (x-z-y) LiNbO^ substrates. After 
polishing and diffusion of Ti to form planar waveguides, we experimentally determine 
the location of the focal planes of the lenses and polish the edges to coincide with 
the focal planes. 
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LENS TOLERANCE ANALYSIS 


Recently, Betts et al (ref. 8) published expressions for tracing rays in 
rotationally symmetric geodesic lenses. These expressions have been used to cal- 
culate the effects of tool wear, and lens depth. 

The profile used in the calculations is for an f/1.5 lens. Some of the results 
of these calculations are given in Figures 7 and 8. Figure 7 shows the focal length 
increase, with percentage depth for both the tool wear analysis and the decrease in 
depth while maintaining the actual profile (surface removal) . 

For an f/1.5 lens, the diffraction-limited spot size is .58 pm at a wavelength 
of 0.8 pm in LiNb 03 . For a 1-cm-diam lens, this corresponds to 3 x 10~^ units in 
Figure 8. Hence, to maintain diffraction-limited performance, a depth change of 
^1% can be tolerated if 90% of the original useful aperture is used. For greater 
tool wear, a smaller aperture is required to maintain a diffraction-limited per- 
formance. This was experimentally demonstrated with a lens fabricated in LiNb 03 , 
where an 80 % aperture gave near-diffraction-limited performance. 

From the experimental and calculated results, we concluded that in any system 
fabricated using geodesic lenses the precise location of the focal plane could only 
be determined after the lens fabrication. The focal position is very sensitive to 
fabrication tolerance, but the focal spot size remains diffraction limited within 
achievable fabrication tolerances. 


SAW TRANSDUCER DEVELOPMENT 


During this phase of the program we developed a 500-MHz bandwidth, 1-GHz operat- 
ing frequency SAW transducer suitable for use in the IOSA, and we acoustically charac- 
terized the Ti:LiNb 03 structure. Experiments were carried out to measure propagation 
loss, velocity dispersion, and acousto-optic deflection efficiency on various 
Ti:LiNb 03 substrates. A modified chirp transducer (MCT) was able to achieve acousto- 
optic deflection bandwidths exceeding 400 MHz for both HeNe and GaAlAs laser wave- 
lengths . This is the largest acousto-optic deflection bandwidth reported to date 
for a single transducer at either laser wavelength. 

The MCT is similar to a conventional chirp transducer in that it has electrode 
spacings that vary smoothly from one end of the transducer to the other. This 
ensures that a portion of the transducer electrodes is synchronous at all frequencies 
within the passband. The chirp transducer response is relatively smooth over its 
bandwidth. 

The total number of electrodes in the chirp transducer can be changed to facili- 
tate impedance matching with virtually no effect on the shape of the output spectrum. 
In fact, the number of electrodes can be set so as to yield the ideal transducer loss 
with no external matching circuit whatsoever. In the case of the chirp transducer, 
each interelectrode gap has a characteristic ("instantaneous") synchronous frequency 
and it can be tilted to satisfy the Bragg condition at that frequency. Figure 9 
shows the modified chirp transducer configuration with tilt angle varying smoothly 
over the transducer band. At any frequency within the band, a small number of elec- 
trodes are in quasi-synchronism, while contributions from the remaining electrodes add 
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in virtually random phase. For this reason, SAW generation is confined to a single 
"active" region that slides along the transducer as the drive frequency is changed. 
The number of excitation electrodes N a comprising an "active" section for a linear 
FM chirp transducer is given by 


N (f) 
a 


N 

JL TOT 
f o \Af Ax 


where N T q T - 2f 0 At is the total number of electrodes, f Q is the midband frequency, 
and At is the transducer length divided by the SAW velocity. The total number of 
electrodes can be varied to satisfy convenient impedance-matching conditions. When 
the transducer aperture is fixed by acousto-optic interaction length considerations, 
NfOT can be increased until the midband capacitive reactance reaches 50 Q. At this 
point, the transducer can be coupled to a 50 fi generator without any tuning. In 
addition, losses due to parasitic resistance in the metal film can be reduced by 
making N a large, since the electrode resistance term is inversely proportional to the 
number of active electrodes. 


The transducer pattern ultimately used in the development of the IOSA was a 
modified chirp transducer operating at 600 MHz with a 400-MHz bandwidth. This trans- 
ducer, shown schematically in Figure 9, consists of 180 electrodes with N a = 21 
arranged in a two-track "dog-leg 11 structure to improve the impedance matching charac- 
teristics. The total aperture is 76 wavelengths, and the total tilt angle variation 
is 1:7°) since this design covers the frequency range from 350 to 900 MHz. The trans- 
ducer photomask pattern with non-parallel electrodes was programmed on a computer- 
controlled optical plotter and photographically reduced in 3 stages. The devices 
were then built using Q Contact optical photolithography. In all cases, the transducers 
were etched from 500-A- thick aluminum metalizations . 


PERFORMANCE 


Data was obtained on acoustic conversion efficiency, propagation loss, velocity 
dispersion, and acousto-optic deflection efficiency. 

The 600-MHz MCT pattern was initially fabricated on untreated yz LiNbC^. Mea- 
surements were made of the insertion loss of this transducer in a non-dispersive 
delay-line configuration and are shown in Figure 10. The measured conversion effi- 
ciency agreed quite well with the theoretical value of 13 dB. The insertion loss 
varies from 14 dB at the lower edge of the band to 12 dB in the middle. Varying the 
length of the gold wire leads causes the passband to tilt by as much as 5 dB. Excel- 
lent agreement between the theoretical and the measured response was achieved. 

Using test delay lines fabricated on Ti-dif fused LiNb 03 samples, measurements 
were made of acoustic propagation losses. The resulting data is shown in Table 2. To 
obtain this data, it was assumed that the measured insertion loss of the untreated 
samples consisted of the sum of the conversion efficiency and the theoretical propaga- 
tion loss predicted by Szabo and Slobodnik (ref. 9). 

There is great disparity in the propagation loss data obtained. An attempt was 
made to correlate the attenuation values with the temperature profiles used during 
the Ti diffusion. The times at each temperature are also shown in the table. 
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Sample N-92, which exhibited essentially theoretical propagation loss values, was 
heat treated at 650° C for 2 hours and at 950°C for 5 hours. The higher losses mea- 
sured for sample N-95 indicate that the time at both 650°C and at 950° C are critical 
in determining the propagation loss of the samples. 

The layered Ti:LiNb 03 structure is expected to exhibit some acoustic velocity 
dispersion. It is important to characterize the amount of dispersion because the 
frequency dependence of the optical deflection angle 6 and hence of the detector 
spacing is dependent on velocity. 

The dispersion of the acoustic time delay was measured for crystals with differ- 
ing diffusion temperature profiles. The results are given in Table 3. These results 
were obtained from a computer-controlled rf network analyzer that measured the inser- 
tion phase of a delay line consisting of two 600-MHz MCTs and performed a second- 
order least-squares fit. The values of 1.6 x 10“^/MHz measured for these devices 
corresponds to a change in time delay of 0.64% for a 400-MHz bandwidth. However, of 
more importance is the error in position of the detector cells because they are 
uniformly spaced. The fractional position error of the high-frequency end of the 
detector can be calculated relative to the low-frequency end. The result is that for 
a 400-MHz bandwidth centered at 600 MHz, the error is 1%, or 1 detector cell width. 

Acousto-optic deflection data was obtained for devices consisting of TiiLiNbO^ 
substrates with 600-MHz MCT transducers. The setup shown schematically in Figure 11 
was found to be the most convenient and the most accurate method for obtaining this 
data. The light beam from the laser is coupled into the waveguide using a rutile 
prism coupler. The guided optical beam traverses the acoustic path, and the 
undeflected and deflected beams emerge from the waveguide at the edge of the crystal. 
There they are imaged on solid-state detectors using a cylindrical lens. The deflec- 
tion efficiency is then determined by taking the ratio of the two optical intensities. 
The SAW transducer is driven by an rf oscillator and amplifier. The incident rf 
power can be monitored using a power meter. The primary advantage of this technique 
is that no output prism coupler is required. The main disadvantage of this technique 
is that a chip-free polished edge must be obtained on the output edge of the LiNbC >3 
substrate. Otherwise, the light in the waveguide is scattered and cannot be properly 
imaged on the detectors. 

Figures 12 and 13 show acousto-optic deflection efficiency versus frequency mea- 
sured with 100 mW of rf power incident on the SAW transducer. 

Figure 12 shows the response obtained using a GaAlAs double-heterojunction laser 
with a TE polarization, and with a wavelength of 0,83 ym. The 0 dB here corresponds 
to a 1.6% deflection efficiency; the transducer bandwidth is 420 MHz. 

As proof of the effectiveness of the tilted fingers to steer the acoustic beam 
and affect the bandwidth, a measurement was made on the deflection coefficient when 
the laser was incident from the opposite side. This is equivalent to changing the 
sign of the direction in which the electrodes tilt. The result is a response that 
is significantly narrowband compared with that shown in Figure 12. Using Equation (15) 
(of Ref. 10), the shape of this response can be calculated; it is shown in Figure 13, 
along with the curve measured. The two curves agree quite well and the response is 
significantly narrowband relative to the parallel electrode case. The slight dis- 
placement in frequency between the two curves results from the experimental uncer- 
tainty concerning the frequency at which the Bragg condition was exactly satisfied. 
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DETECTOR ARRAY AND CCD READOUT 


The detector array of the IOSA is designed to detect SAW deflected laser light, 
to provide information on incoming rf frequency, and to read out with a charge- 
coupled-device shift register. 

The CRC 139 detector array is made of a single row of 100 silicon MOS photo- 
detectors and 4 output CCD shift registers located on both sides of the detectors. 
These small photodetectors are made of MOS capacitors with semi-transparent poly- 
silicon gates. The CCD shift register is fabricated using polysilicon gate buried- 
channel CCD technology to reduce noise and increase the rate of charge transfer. 

The CCD is fabricated with n-channel CCD technology to take advantage of higher elec- 
tron mobility. The schematic cross section of a buried p-channel MOS photodetector 
compatible with the above n-channel CCD technology is shown in Figure 14. 

When a bias voltage is applied on the gate of the MOS capacitor, a depletion 
region is formed under the polysilicon gate. Electron-hole pairs generated by photons 
penetrating the silicon substrate will be separated by the electric field in the 
depletion region. The electrons collected at the Si-Si02 interface become the signal 
charge . 

The primary performance goals of the detector array were: 
m An rf frequency resolution of 4 MHz 
m A dynamic range of 40 dB 

® A total sampling time of <3 ysec. 

The detector cell width S is constrained by the required rf frequency resolution 
and by the allowed optical crosstalk between adjacent cells discussed earlier. Using 
the parameters associated with Figure 3, the detector cell width defined by frequency 
resolution requirements is S < 4.26 x 10“^F, and the cell width defined by optical 
resolution requirement is S > 1.32 x 10“^F. A small focal length F corresponds to a 
small detector size. The detector width, however, is restricted in the lower limit 
by detector-CCD fabrication and coupling considerations , and by dynamic range require- 
ments. The detector coupled to CCD shift registers is shown in Figure 15. To facili- 
tate the charge transfer, it is desirable to have equal width for both the detector 
and the CCD cell. It is difficult to make the width smaller than 16 ym per bit or 
8 ym per detector for a 2 detector to I bit dual multiplexer. There are channel 
stops between the detector cells to isolate the cells from one other. The minimum 
channel stop width that can be made on a mask is 1.5 ym and will be 3.5 ym wide 
after processing. With a spot size of about 2.5 ym, the detector size should be at 
least 7 ym to contain the full light spot. To satisfy the dynamic range require- 
ment of 40 dB requires a rather large detector storage area. Any further reduction 
in detector width will make the detector length even longer and slow down charge 
transfer. Based on these considerations, we chose a detector width of 8 ym for 
the Fourier transform lens with a focal length of 1.88 cm. This gives the required 
frequency resolution of 4 MHz. 

A dynamic range of 40 dB is required for system applications. The dynamic range 
is defined as the range between detector saturation due to strong signal and the over- 
all noise level. The determination of expected noise level is, therefore, very impor- 
tant for designing the full charge capacity of both the detector and CCD unit cells. 
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The usual noise sources of a detector/CCD array can be separated into three 
categories: those associated with the input, the integration and transfer, and with 

the output. These noise effects are listed in Table 4 with special emphasis given 
to the comparison of noise characteristics of surface- and buried-channel CCDs. 


The conclusion that the noise level of a surface-channel CCD is about four times 
higher than that of a buried-channel CCD has led to the selection of the buried 
channel device for this program. To achieve a dynamic range of 40 dB, or 10 ^ to 1, 
the charge capacity of both the detector and the CCD storage well must be at least 
10^ times greater than the noise level. The noise level of a buried-channel CCD and 
its output amplifier is about 210 electrons. A full charge capacity of 6 x 10 ^ elec- 
trons is thus designed for this array. Assuming a charge density of 1 x 10^2 
electrons/cm^, this results in a detector of 

g 

6 x 10 electrons , .. „-6 2 

^2 J = 6 x 10 cm . 

1 x 10 1 electrons/cm 

With an effective detector width of 4.5 ym, this requires a detector length of at 
least 


(6 x 10 cm^) 
(4.5 x 10 ^ cm) 


1.33 x 10 


cm 


133 ym . 


Therefore, the detector size is designed to be 8 ym by 150 ym. 

The 100 detectors should be sampled as fast as possible, preferably within 
3 ysec. Therefore, the detector, the CCD shift register, and the output amplifier 
are all required to have high-speed operation. Three basic mechanisms are responsible 
for carrier transfer in a CCD: thermal diffusion, self-induced drift, and fringing 

fields. A summary showing thermal diffusion time constants, fringing field, and 
carrier mobility is shown in Table 5. An epi concentration of 5 x lO^/cm^ (30 Q-cm) 
and a potential difference of 10 V between adjacent electrodes are assumed. Also 
listed in Table 5 are t^, the time to achieve a transfer efficiency of four 9’s or 
0.9999, and f, the maximum operating frequency assuming a 2-phase clocking. 


With a readout scheme as shown in Figure 15, each of the CCD registers will have 
a maximum of 100 transfers between detector and output amplifier. If a 3-ysec 
sampling time is desired, a transfer time t 4 of less than 3 x 10“8 sec would be 
required. Table 5 shows that CCDs with an electrode length of 7.5 ym or less have 
a very good chance of meeting this goal. The electrode length chosen in the present 
design is 6 ym. 


IOSA DEVICE TESTING AND PERFORMANCE 


The IOSA is a complex device and extensive testing is needed for performance 
evaluation. Looking at the close-up of a fully assembled IOSA in Figure 16 we can 
clearly discern the detector/CCD chip at the top end, the two geodesic lenses in the 
middle, and the semiconductor laser at the lower end of the LiNb 03 integrated optic 
chip. Two sets of SAW transducers can be seen in the middle of the LiNb 03 chip on 
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either side of the axis of the crystal. The significant features of the IOSA’ s 
performance are: 


© Acousto-optic bandwidth 

® Single-tone rf resolution 

® Single-tone dynamic range 

© Two-tone rf resolution 

@ rf pulse measurements. 


Acousto-Optic Bandwidth 

The IOSA device depends on acousto-optic Bragg deflection for its successful 
operation. The bandwidth associated with the acousto-optic interaction depends on 
the rf bandwidth of the transducer for efficient SAW generation, as well as 
k-vector matching of the optical and surface acoustic waves. But a purely rf charac- 
terization of the transducer reveals only a part of the picture. In order to evalu- 
ate the acousto-optic response we make use of an experimental arrangement, 
illustrated by the schematic in Figure 17. To obtain the optimum acousto-optic 
response we need to arrive at the correct position for the semiconductor laser. 

This is decided by measuring the acousto-optic response corresponding to different 
transverse positions of the semiconductor laser with respect to the axis of the 
collimating lens. In this measurement a 1 kHz amplitude modulated rf signal is 
amplified and applied to the SAW transducer of the IOSA. The directional coupler 
between the rf amplifier and the SAW transducer facilitates measurement of the 
incident and reflected rf power levels. The deflected optical spot at the output 
edge of the crystal is imaged onto a photomultiplier. The beam splitter arrange- 
ment allows monitoring of the output edge of the crystal on a video monitor. The 
signal from the photomultiplier, together with the 1 kHz modulating signal from the 
wavetek oscillator, is fed to the signal and reference channels of a PAR 124A lock- 
in amplifier. The analog output from the lock-in is used to drive an XY-plotter. 
Because of the lock-in measurement technique employed, any scattered light from 
the substrate does not cause problems. With this experimental arrangement we have 
been able to rapidly measure the acousto-optic bandwidth for each position of the 
semiconductor laser. 

The optimum acousto-optic response, measured with an HLP 1400 laser coupled to 
the IOSA chip, is shown in Figure 18. The 3 dB bandwidth is ^380 MHz. The acousto- 
optic deflection efficiency corresponding to the 0 dB level at a frequency of 

615 MHz and 100 mW, was measured as the ratio of the deflected spot signal to the 

undeflected spot signal. The deflected spot signal was arrived at by substrating 
out the signal due to any scattered background. For the two samples tested we have 
observed efficiencies of 0.83% (sample 4 with HLP 3400 U laser), and 0.97% (sample 
3 with HLP 1400 CSP laser) , respectively. The irregular structure observed in the 
acousto-optic response of Figure 18 is due to rf mismatch which arises from the 
coaxial cable (RG57, BNC) between the output of the directional coupler and the SAW 
transducer. This has been checked by monitoring the reflected power, keeping the 

incident power at a fixed value of P aV g ^100 mW, and measuring the acousto-optic 

response over the frequency range 400 MHz to 900 MHz. In Figure 19 we have plotted 
the acousto-optic response, as well as the net rf power flowing into the SAW 
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transducer/coaxial cable pair versus frequency. The correspondence between the peaks 
and valleys of the acousto-optic deflection efficiency with the peaks and valleys 
of the net power into the SAW transducer confirms this conclusion. 


Single-Tone RF Resolution 

The IOSA device was designed to have an acousto-optic 3 dB bandwidth of 400 MHz, 
with a center frequency of 600 MHz and a single-tone resolution of 4 MHz. Acousto- 
optic response measurements discussed earlier indicate that the Bragg cell design 
does indeed yield a 3 dB response of ^380 MHz, very close to the > 400 MHz rf 
bandwidth. The single-tone resolution of the device depends on a number of factors 
such as the gap between the laser emission facet and the input end of the LiNb 03 
crystal, the positions of the focal planes of the collimating and Fourier trans- 
forming lenses with respect to the edges of the LiNbC^ crystal, SAW transducer 
alignment, and the distance between the output end of the LiNb 03 crystal and the 
surface of the detector/CCD chip. 

Even though the acousto-optic response was less than optimum, single-tone 
frequency measurements performed on the device are nevertheless interesting. 

A single-tone measurement carried with a 725 MHz 100 mW rf signal is shown in 
Figure 20. The detector/CCD chip is organized in a way that data is read out in two 
channels as strings of 50 pixels each. In order to determine the frequency resolu- 
tion we need to have data from both the output channels properly interleaved on 
the display. The photograph in Figure 20 shows output of all the 100 pixels in 
proper time sequence. The upper photograph is an oscilloscope trace of the 100 
detector/CCD array. Referring to Figure 20, the straight horizontal part of the 
trace on the left-hand side defines the zero signal level. The pedestal seen in all 
of the other pixels in Figure 20 is due to in-plane scattered light. The pixel 
corresponding to the rf signal of 725 MHz has been highlighted. The adjacent cells 
that are 4 MHz away by design exhibit a signal half as strong as the pixel cor- 
responding to the peak. The 3 dB frequency resolution is therefore 8 MHz. The 
lower half of Figure 20 illustrates the display on a regular electronic spectrum 
analyzer when the same 725 MHz rf signal is incident on it. As a second example, 
the photograph in the upper part of Figure 21 illustrates the output from one half 
of the detector/CCD array when a 705 MHz, 100 mW rf signal is incident on the SAW 
transducer of the IOSA. Once again, the lower part is the display of a conventional 
spectrum analyzer with the same 705 MHz, 100 mW rf signal incident on it. 


Single-Tone Dynamic Range 

The dynamic range of response is an important parameter in the evaluation of 
the IOSA. As outlined earlier, in-plane scattered light appearing at the output 
of the LiNb 03 crystal results in a fixed pattern of noise impinging on the detector/ 
CCD array. The resulting pedestal of background light is clearly illustrated in 
Figure 20. Dynamic range measurements were carried out with respect to the fixed 
background pattern. Measurements were carried out without resorting to a lock-in 
type of phase sensitive detection. With the output of the detector/CCD chip dis- 
played on an oscilloscope, we read signals off the screen corresponding to each 
value of the rf drive signal. Also, to get an estimate of the cross-talk figure for 
our device, we measured the signal appearing in a pixel which was next to the nearest 
neighbor, corresponding to a frequency gap of 8 MHz. The result of one such 
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measurement is shown in Figure 22. The detector/CCD clocking was carried out at 
333 kHz , with the rf at 729 MHz and an I laser = 21 mA. The signal in the main 
pixel saturates for rf power greater than about 200 mW, resulting in' a dynamic 
range of 20 to 23 dB, with the minimum visually discernable signal level at about 
1 mW (rf). The receiver sensitivity for the I0SA as determined by observing the 
detector/CCD output on an oscilloscope would be 1 mW (of rf power) , The signal 
in the next to nearest neighbor (8 MHz array) appears to be a factor of 4 smaller 
than the main pixel. This measurement is again in agreement with our 3 dB resolution 
figure of 8 MHz (Figure 20) . 

The dynamic range is currently limited by saturation of the detector at the 
high signal end, and by the presence of the fixed pattern background signal at the 
lower end. Steps such as improved crystal handling, improved waveguide fabrication, 
and polishing of the crystal surface after waveguide formation can be pursued to 
reduce in-plane scattering of light. 


Two-Tone RF Resolution 


Functionally, the IOSA is expected to receive and analyze multiple rf signals 
for their frequency and amplitude characteristics. An important parameter in 
analyzing more than one rf component at a time is the minimum spacing between 
adjacent signals below which the device fails to resolve. The two-tone resolution 
measurement attempts to quantify the device performance in regard to simultaneous 
multiple signal spectral analysis. In this device design, the spacing between 
adjacent detector pixels corresponds to an rf increment of 4 MHz. If the achieved 
rf resolution were such that a significant response was observed in detector pixels 
adjacent to the intended pixel, two-tone resolution would suffer. Single-tone 
resolution measurements discussed earlier (see Figure 20) have shown that the 
strength of the signal in an adjacent detector pixel 4 MHz away is only a factor of 
2 smaller. Assuming a simple Gaussian spot shape impinging on a detector element, 
we would not expect the device to resolve a pair of adjacent rf signals of equal 
strength and 8 MHz separation. Yet our measurements with the device indicate that a 
pair of equal strength rf signals 8 MHz apart are resolved. Oscilloscope displays 
of the output of the detector/CCD chip, when the SAW transducer is driven by two rf 
signals of 100 mW intensity separated by 8 MHz, is illustrated in Figure 23. In 
Figure 23(a) the output of the 100 element detector/CCD chip is shown in proper time 
sequence. In Figure 23(b) the outputs from the two shift registers are displayed 
separately. The rf spacing between adjacent points in Figure 23(b) is 8 MHz, while 
the spacing between adjacent points in Figure 23(a) is 4 MHz. In Figure 23(c) and 
(d) we show time expanded details of the displays in parts (a) and (b) . The rf 
spectrum of the signal used to generate the output in Figure 23(a) is shown in Fig- 
ure 23(e). The central signal at 689 MHz is 20 dB smaller than its two neighbors 
at 685 MHz and 693 MHz, respectively. Hence, the full 200 mW of rf power is dis- 
tributed equally between the two signals at 685 MHz and 693 MHz. An examination of 
the display in Figure 23(a) shows that the pixel corresponding to 689 MHz exhibits 
a response 0.7 dB below the pixels at 685 MHz and 693 MHz. Thus, two equal strength 
signals separated by 8 MHz are clearly resolved. 

In order to characterize the two-tone resolution of the device, we carried out 
a number of other measurements. Similar displays were obtained when the spacings 
between the two 100 mW rf signals were 12 MHz, 16 MHz, 24 MHz, and 40 MHz, 
respectively. Resolution of the two rf signals in each case was very clear, as shown 
in Figure 24, with a frequency offset of 12 MHz. 
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An interesting feature of the two-tone measurement is concerned with the 
measurement of the dynamic range of a signal in the presence of a strong signal 
nearby. Referring to Figure 20, we find it necessary to move 20 MHz (5 or 6 
adjacent detector pixels) to achieve a cross figure of -20 dB. This means that in 
the presence of a strong signal, a weak signal would not be clearly resolved (with 
a cross-talk figure of -20 dB) unless it were > 20 MHz away. The two- tone dynamic 
range is expected (if Af > 20 MHz) to be better than 10 dB for the weak signal. 

In order to improve the two-tone dynamic range we would need to improve the fre- 
quency resolution. 


rf Pulse Measurements 

The I0SA device is expected to receive and analyze rf pulses of short time dura- 
tion. If a laser with a divergence angle of 11,45° is used with this optical sys- 
tem (with a focal length of 20 mm) , it generates a collimated optical beam 4 mm wide. 
Combining the 4 mm optical beam width with a surface wave velocity of 3.5 x 10^ cm/sec 
yields an acoustic aperture of 1,15 ysec. At any given instant in time the acousto- 
optic interaction can only analyze an acoustic wave train that is 1.15 ysec long. 
Because of the integrating nature of the detector /CCD chip, all light impinging on 
a given detector pixel during one integration period contributes to the signal 
packet. An acoustic aperture of 1.15 ysec corresponds to a frequency resolution of 
0,85 MHz, and there does not appear to be an advantage in integrating signals for 
periods longer than the acoustic aperture. For analyzing pulses, it would be 
advantageous to carry out the signal integration process for the duration of one 
acoustic aperture. As the integration time is reduced, one must increase the power 
output of the semiconductor laser by a proportionate amount in order to maintain the 
large signal handling capacity of the device. At the same time, laser power scaling 
has a definite upper limit due to the limited power output (10 to 15 mW) of single 
mode semiconductor lasers. 

With the present device we have operated the detector/CCD chip at a clocking 
frequency of 333 kHz, with an integration time of 192 ysec. With a laser power 
output of 1.5 mW and a total insertion loss of 17 dB from the laser facet to the 
output of the crystal, signal saturation appeared at rf drive power levels of 200 mW. 
We have carried out a set of calculations to arrive at different sets of operation 
alternatives for our IOSA. The results are summarized in Table 6. The last row 
in Table 6 indicates that with a 15 mW laser output and an integration time of 
0.96 ysec, signal saturation should occur at an rf power level of 1000 mW. These 
are all numbers that appear reachable at this time. 

To highlight some of the above issues, we present rf pulse measurement data in 
Figure 25 for the case when the integration time was 38 ysec (CCD clocking frequency 
of 1.67 MHz) and pulses of various duration were applied to the SAW transducer suc- 
cessively. The pulse widths were 38 ysec, 19 ysec, 15.2 ysec, 11,4 ysec, 7.6 ysec, 
and 3.8 ysec. The rf (710 MHz) power level was set at 100 mW under cw conditions, 
and the modulating signal's pulsewidth was varied to achieve the different duty 
factors. The signal level is reduced successively in direct proportion to the duty 
factor, as shown in the displays in Figure 25(a) through (f ) . For purposes of 
establishing the background level, we show the detector/CCD output in Figure 25(g) 
with no rf power applied to the transducer. Finally, in Figure 25(h) we have 
included the spectrum of the modulated signal (duty factor 0,2) as seen on a regular 
electronic spectrum analyzer. The displays in Figure 25(a) through (g) highlight 
the problems involved in analyzing rf pulses that are small fractions of the 
detector/CCD integration times. 
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CONCLUDING REMARKS 


The results of measurements made on an integrated optic rf spectrum analyzer 
are reported. The performance of the device acousto-optic bandwidth, single- tone 
rf resolution, two-tone rf resolution, single-tone dynamic range, two-tone dynamic 
range, and single-tone rf response are presented. The device parameters that con- 
trol device performance are analyzed. These results demonstrate the viability of 
the IOSA for real time spectrum analysis of pulsed and cw rf signals. 

Improvements of rf bandwidth resolution can be obtained by the use of larger 
collimated optical beams. This requires larger optical lens elements, and hence, 
larger crystals. 
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Table 1, Parameters of Coll imat ion and Fourier Transform Lenses. 


Focal length 

18 . 8 mm 

Diameter 

8 . 7 mm 

Depth at center 

1.1 mm 

Useful aperture 

7 . 4 mm 

f /no . 

2.54 


Table 2. Measured Attenuation Values for Titanium 
Diffusion LiNbO , obtained at 600 MHz 
and 1000 MHz. 


Sample Number 

Oxidation 
Time at 
650°C , hr 

Diffusion 
Time at 
950°C , hr 

Attenuation, 
dB/ ysec 

600 MHz 

N-90 

0 

5 

1.9 

N-91 

0 

8 

7.0 

N-92 

2 

5 

0.54 

N-95 

4 

6 

1.9 

Untreated 

0 

0 

0.43 

surface 




(theoretical) 
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Table 3, Measured Velocity Dispersion Coefficients 
for Ti:LiNbO~ Substrates. Data Was Taken Over 
the 400 to 800 MHz Range. 


Sample 

Oxidation 
Time at 
600°C, hr 

Diffusion 
Time at 
1000°C, hr 

dT “ T (If ) * 10 " 5 

-1 (' '''“'I 
or \W 

A 

4 

5 

-1.59 

B 

0 

5 

f 

00 

C 

4 

5 

-1.57 

D 

0 

5 

-1.59 

E 

4 

6 

-1.55 

F 

4 

6 

-1.45 

G 

4 

6 

-1.57 

H 

4 

6 

-1.75 


Table 4. Noise Characteristics of Surface-Channel 
and Buried-Channel CCDs. 


Operation 

Process 

Surface 

Channel 

Buried 

Channel 

Input 

Fat zero input 

178 

0 

Integration and 
transfer 

Fast interface 
states 

834 

0 


Bulk states 


90 


Dark current 

<100 

<100 

Output 

Reset amplifier 

147 

147 


M0SFET 

68 

68 

i 

Total noise 
electrons 
(rms) 

All 

873 

210 
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Table 5. Charge Transfer Characteristics. 


Carrier 

Mobility, 

cm^/V-sec 

Electrode 

Length, 

ym 

t th, 

sec 

t 

tr, 

sec 

L, 

sec 

f, 

MHz 

600 

10 

2.56xl0 _8 

9.67xl0 _9 

6.6x10 

7.5 

600 

7.5 

1.44xl0~ 8 

3.04x10 ^ 

3.35xl0 -8 

14.9 

600 

6 

9.23xl0“ 9 

1.29xl0 -9 

2.04x10 

24.5 

1000 

10 

1.54x10 8 

5.80xl0 -9 

3.97xl0 _8 

12.5 

1000 

7.5 

8 . 65xl0~ 9 

1.83xl0 _9 

2 .OlxlO -8 

24.8 

1000 

6 

5 . 54xl0~ 9 

7 . 76xl0 -1 ° 

1 . 23xl0 -8 

40.6 


Table 6. Scaling of Laser Power and CCD Clocking 
to Maintain Performance. 


Laser 
Power 
Output , 
mW 

Total 
In/Out 
Insertion 
Loss , dB 

Output 
Zeroth 
Order 
Power , 
yW 

RF 

Power 

at 

Saturation, 

mW 

Integration 
Time, ysec 

1-5 

17 

30 

200 

19.2 

15 

17 

300 

200 

19.2 

15 

11 

1200 

200 

OO 

15 

11 

1200 

, 

1000 

0.96 
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OPTICAL BEAM WIDTH X CELL SIZE X 10“^ MM 


Figure 3 


- Optical crosstalk in the adjacent cell of the detector array as a 
function of optical beam width and detector cell size. 


SCATTERING ANGLE IN AIR, DEG 



1.3 1.0 0.5 0 0.5 1.0 1.3 

IN-PLANE SCATTERING ANGLE, DEG 


Figure 4.- In-plane scattering measurement of Ti:LiNb0 waveguide. 









0.1 0.2 0.3 


FOCAL LENGTH INCREASE (LENS RADIUS = 1 UNIT) 

Figure 7 . - Focal length increase versus percentage of depth decrease for both 

tool wear and surface removal. 



2x10' 3 4x10" 3 6x 10 3 8 x 10 3 1 x 10 2 1.2x 10 2 1.4x 10 2 

GEOMETRICAL SPOT SIZE (LENS RADIUS * 1 UNIT) 

Figure 8.- Geometrical spot size versus percentage of depth decrease for both 
tool wear and surface removal. Two curves are provided for each one 
corresponding to 100% and 90% of the original useful aperture. 







Figure 9.- Schematic of a modified chirp transducer with a two-track dog-leg 

electrode geometry. 


CALCULATED RESPONSE 


MEASURED RESPONSE 



Figure 10.- Comparison between the calculated and measured responses of the 

600-MHz modified chirp transducer. 
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COUPLING 
PRISM 


DEFLECTED 
BEAM 


OPTICAL 

DETECTORS 



Mjndeflected 

BEAM 

MODIFIED 
I CHIRP IDT 


'CHIP-FREE 

POLISHED 

EDGE 


Figure 11.- Test configuration used in the measurement of the Bragg acousto- 
optic deflection coefficient. 



A 0 p = 0.83 pm TE MODE 
POWER LEVEL = lOOmW 
OdB = 1.6% 

BW 3dB = 420 MHz 

Figure 12.- Measured acousto-optic deflection efficiency for a TE-mode GaAlAs 

laser. 
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FREQUENCY, MHz 

Figure 13.- Comparison of the theoretical and measured acousto-optic deflection 
efficiency when the sign of the electrode tilt is reversed. 


p+ CHANNEL STOPS DELINEATING 
CELL BOUNDARIES - 

DETECTOR 
ICELL 


SiO 





EPI 

LAYER 



POLYSILICON 
BIAS GATE 

ION IMPLANTED 

BURIED 

CHANNEL 

ISOLATION 

LAYER 


Figure 14.- Cross section of buried p-channel MOS photodetector. 
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TRANSDUCER NO. 2 


NA: 0.3 


Figure 17.- Schematic of arrangement used to measure the acousto-optic response 

of the IOSA. 



FREQUENCY, MHz 



Figure 18.- Acousto-optic response measured on a device with a Hitachi HLP 1400 

CSP laser. 
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Figure 19. - Acousto-optic response and net rf power into SAW versus frequency. 
The clear-cut correspondence in the position of the two plots leads to the 
conclusion that the irregular structure in the acousto-optic response is due 
to rf matching irregularities. 




DETECTOR/CCD OUTPUT FOR A 725 MHz/ 100 mW 

RF SIGNAL APPLIED TO THE I OS A 

HORIZONTAL: 20 ^s/DIV 

VERTICAL: 50 mV/DIV 

3dB RESOLUTION: 8 MHz 



Figure 20.- Single-tone frequency resolution on the IOSA device. Sample 4. 
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(b) 



t 


705 MHz 


Figure 21.- Response from one half of the detector/CCD array when driven by a 
100 mW, 705 MHz rf signal. (Vert: 50 mV/div, Horiz: 20 ysec/div) . 
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DETECTOR/CCD OUTPUT SIGNAL, mV 



Figure 22. - Dynamic range and cross talk measured at a CCD clocking frequency 
333 kHz (Sample 4, HLP 2400 laser, pixell 11 from the left). 





VERTICAL SCALE: 50 mV/div FOR a), b), c) AND d). a) AND c) COMPOSITE 
100 ELEMENT DISPLAY. HORIZONTAL SCALE: 20 ftsec/div AND 10 ^sec/div. 
b) AND d) SEPARATE DISPLAY OF THE TWIN DETECTOR ARRAY CHANNELS. 
HORIZONTAL SCALE: 20 ^sec/div AND 10 ^sec/div 


Figure 23.- Two-tone response on Sample 4, AF=8 MHz, F =689 MHz, (P ) =200 mW 

HLP 2400 laser, I =21 mA.° rf tot 

laser 
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(c) 


VERTICAL SCALE: 50 mV/div FOR a), b), c) AND d). a) AND c) COMPOSITE 
100 ELEMENT DISPLAY. HORIZONTAL SCALE: 20 /isec/div AND 10 ^sec/div. 
b) AND d) SEPARATE DISPLAY OF THE TWIN DETECTOR ARRAY CHANNELS. 
HORIZONTAL SCALE: 20 /isec/div AND 10 ft sec/div 


Figure 24.- Two-tone response on Sample 4, AF=12 MHz, F^-689 MHz, (P^) to) _-200 mW, 

HLP 2400 laser, I laser =21 
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Figure 25.- RF pulse measurements. CCD clocking frequency 1.67 MHz. I =27 mA, 

laser 

Vert scale: 50 mV/div, Horz. 5 ysec/div. a) cw rf power 100 mW. Pulse duty 
factors: b) 0.5, c) 0.4, d) 0.3, e) 0.2, f) 0.1 and g) 0. h) display the rf 
spectrum of a pulse train with a duty factor 0.1. 
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APPLICATIONS OF ELECTRO-OPTIC GRATINGS IN 
INTEGRATED OPTICAL SIGNAL PROCESSING DEVICES* 

Carl M. Verber 

Battelle Columbus Laboratories 
505 King Avenue 
Columbus, Ohio 43201 


SUMMARY 


The electro-optic grating is an easily fabricated component which allows a rapid 
and efficient interaction with an optical wave in a planar electro-optic waveguide. 
The operation of such gratings and their use as intensity modulators, spatial light 
modulators, and components in correlators and in a variety of computational units is 
described . 


INTRODUCTION 


The electro-op tic effect has been the most important interaction mechanism used 
in the design of the active integrated optical devices employing channel waveguides. 

In these devices, the designer is able to take advantage of the lateral confinement 
of the light which allows long interaction lengths and consequently low voltages. In 
the present paper we will, in contrast, be concerned with the use of the electro-optic 
effect in planar integrated optical structures. In this planar geometry, the guided 
light is confined to a several micrometer-thick waveguide at the surface of a suit- 
able electro-optic substrate. Lateral diffraction is not inhibited, but the confine- 
ment close to the surface allows the use of moderate voltages to obtain a strong 
interaction with the guided light. In particular we will show how the interaction of 
a relatively broad guided beam with an electro-optically induced thick phase (Bragg) 
grating^ can be used to advantage in a variety of integrated optical devices. We 
begin by describing the properties of the gratings and then discuss a number of modu- 
lators, correlators and computational units which incorporate these gratings. 


THE ELECTRO-OPTIC GRATING 


The basic geometry of the 1.0. grating electrodes is shown in figure 1. The 
electrodes are defined by standard photolithographic techniques upon the surface''* of 
a planar electro-optic optical waveguide such as Ti-indif fused^ LiNb 03 . It is conven- 
tional to make the electrode line-wide equal the spacing between adjacent fingers, so 
the required photolighographic resolution is A/4. 


* Work supported in part by Air Force Office of Scientific Research. 

** Of'ten a sputtered glass buffer layer is employed^ to isolate the guided wave 
from perturbation due to the metallization pattern. 
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The application of a voltage across the electrodes results in a periodic elec- 
tric field which, via the electro-optic effect, gives rise to a periodic modulation 
of the index of refraction of the waveguide material and thus to the mode index of 
the guided wave. The guided wave sees this periodic perturbation as a thick phase 
grating and will be diffracted as indicated in figure 1 when incident upon the grat- 
ing at the Bragg angle, defined by 


Sin0 B = JR 

where A is the optical wavelength in the medium. 

5 


The diffraction efficiency is 

T] = 


sm 


TrAnd 


A cos0^ 
o B 


a) 


( 2 ) 


where An is the amplitude of the periodic index modulation. The magnitude of An is 
determined by the product of the applied field strength E, and the appropriate elec- 
tro-optic coefficient r. . according to^ 

ij 

An = 1 n 3 r . ,E (3) 

2 ij 


where n is the average index of refraction. Inclusion of geometric effects^ results 
in an index modulation 
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or a diffraction efficiency of 
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Ignoring buffer layer effects, eq. (5) indicates that for A = 8 ym, a He-Ne laser and 
d = 2 mm, we get 100% diffraction efficiency for V - 3.1 volts. Since high diffrac- 
tion efficiencies are readily achieved and the electrode capacitances are quite low, 
it is evident that the electro-optic Bragg effect can be utilized to make a high per- 
formance modulator. This device was originally suggested by Hammer and Phillips,^ 
and has more recently been employed by Holman^ to make a high performance planar 
modulation with a 69% optical throughput. 


THE INTEGRATED OPTICAL SPATIAL LIGHT MODULATOR 


The basic grating structure can be extended as shown in figure 2 by introducing 
electrodes which allow segments of the grating to be individually addressed. In this 
manner, one can impose a transverse amplitude modulation upon the diffracted beam. 

The undiffracted beam will of course have a complementary modulation. The grating 
structure is now operating as an electrically addressable integrated optical spatial 
light modulator (IOSLM) and can, in principle, be used to modulate an arbitrarily wide 
guided wave. The modulator can be used in an analog or a binary mode, although there 
will obviously be a finite number of addressable segments. The largest such IOSLM we 
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have fabricated thus far 9 is composed of 32 segments 200 ]im wide and spans a 6.4 nun- 
wide guided wave. 

It should be possible to reduce the width of each grating segment to 50 ym or 
less. It would then be possible to use the IOSLM in the signal and filter planes of 
a planar optical Fourier transform device. 10 However, even with the larger grating 
segment size, several useful functions can be performed by incorporating a SAW trans- 
ducer on the same substrate as the IOSLM. If the orientations of the EO and acoustic 
gratings are chosen so that only doubly diffracted light is detected, then by intro- 
ducing a single acoustic pulse which is shorter than a single IOSLM element, the 
transverse modulation produced by the IOSLM is converted to a temporal modulation of 
the output (doubly diffracted) beam. Parallel-to-serial conversion is thus accomp- 
lished. Another application involving the combined IOSLM SAW structure is the 
correlator, which is shown schematically in figure 3. Such a device has been con- 
structed^ and operates on a 32 bit word at a data-rate of 17.5 Mbit/sec. 


COMPUTATION WITH GRATINGS 


Consider once again the simple grating structure shown in figure 1. If voltage 
is applied to one electrode and voltage Vg to the other, the diffracted light in- 
tensity is given by 

I = I o Sin 2 [a(V A - V B )] (6) 


where d is a constant. For I < 0.1 I we have 

o 


1 - “ 2(V A - V 2 


(7) 


Now arrange N such electrode sets in a vertical line so that they can each be address- 
ed by the same broad optical beam. If the N components of the vector 5 are applied 
to one electrode of each set and the corresponding component of the vector B is 
applied to the opposed electrodes then the total deflected intensity is 


I 


E I. 

i=l 1 


Z 

i=l 


Ct 2 (v 
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( 8 ) 


We have therefore calculated the magnitude of the difference of the two vectors. 

Vector multiplication can also be performed as is suggested in figure 4. In 
this structure the diffracted intensity from the i^h segment is 


I . 

i 


1 ri A . ru . 
o Ai Bi 


(9) 


Of course, each p is actually proportional to the square of the voltage difference 
but it can be shown that by a simple signal processing technique, the desired ApBp 
term can be extracted from Ip. The structure shown in figure 4 is therefore capable 
of performing the scalar product of two vectors. 
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It should also be noted that if binary signals are applied to the grating struc- 
tures then standard logic operations may be performed. The structure used for sub- 
traction can be used to generate the EXCLUSIVE OR operation* or the NOT operation* 
while the structure used for multiplication can be used for the logical AND. These 
operations may also be carried out in parallel, and in the case of binary signals* 
the complications arising from the the sin^ response are no longer present. 

SUMMARY 


We have shown a variety of applications of electro-op tically induced Bragg grat- 
ings in integrated optical signal processing and computation devices. The gratings 
are easy to fabricate and operate efficiently on relatively low voltages. The design 
principles are well known and reliable. It is therefore quite likely that a large 
number of additional devices employing similar grating structures will be developed. 
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transducer 



Figure 3.- Correlator incorporating a programmable IOSLM and a SAW transducer with 
digital modulation. 



Figure 4.- Electrode structure for vector multiplication. 
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FOUR- AND FIVE-LAYER SILICON-CLAD 
DIELECTRIC WAVEGUIDES* 


Glen McWright and T. E. Batchman 
University of Virginia 
Charlottesville, Virginia 


ABSTRACT 


Computer modeling studies on four-layer silicon-clad planar dielectric wave- 
guides indicate that the attenuation (a) and mode index ($/K) behave as exponentially 
damped sinusoids as the silicon thickness is increased. The observed effect can be 
explained as a periodic coupling between the guided modes of the lossless structure 
and the lossy modes supported by the high-refractive index silicon. Furthermore, 
the attenuation and mode index are significantly altered by conductivity changes in 
the silicon. An amplitude modulator and phase modulator have been proposed using 
these results. Predicted high attenuations in the device may be reduced signifi- 
cantly with a silicon dioxide buffer layer. 


INTRODUCTION 


A need has arisen for direct optical modulation technology. This need has 
arisen from the search for faster digital switches, higher capacity data channels, 
and light, compact data preprocessing equipment for satellites. One promising 
technology that has been examined is the modulation of a guided light wave via 
photoconductivity changes in a semiconductor cladding. 

Computer modeling studies on four- and five-layer, silicon-clad, planar di- 
electric waveguides indicate that the propagation characteristics can be altered 
by changes in the complex permittivity of the silicon and in the thickness of the 
silicon. Using these predictions, an intensity modulator and a phase modulator 
based on photon- induced conductivity changes in the semiconductor cladding have 
been studied. 


SEMICONDUCTOR-CLAD WAVEGUIDES 


The four-layer planar waveguide structure under consideration is shown in 
Figure 1, where it is assumed that light is propagating in the dielectric (N^) 
and all materials are lossless except for the semiconductor (N^) • We desire to 
solve for the complex mode propagation constant (a + j 3) • 


^Research sponsored by NASA-Langley Research Center under Grant NSG-1567. 
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One technique (1) extends Maxwell’s equations and boundary conditions to 
numerically solve a transcendental equation relating the attenuation constant 
(a) and phase constant ( 3 ) to the material types and thicknesses of the wave- 
guide structure (hereafter refered to as PROGRAM WAVES). 

A more efficient method (2) utilizes a matrix representation of Maxwell’s 
equations, field solutions and boundary conditions in each waveguide layer (here- 
after refered to as PROGRAM MODEIG) . The matrices are multiplied and a character- 
istic matrix for the entire structure is obtained which yields the attenuation 
constant and phase constant. 

The waveguide consists of a semi-infinite glass substrate, a polystyrene core 
of thickness 1 micrometer, a silicon cladding of .01 micrometer to 10 micrometers 
in thickness, and a semi-infinite layer of air. Each material is characterized by 
a complex relative permittivity, £; a free space wavelength of 632.8 nanometers is 
assumed and material parameters are shown for this wavelength (Fig. 1) Layers 
N^, and N 4 are lossless dielectrics, so £ r is real; however, at optical frequencies 
the permittivity of the silicon (N 2 ) is complex (£ r = £ f r 4- j£ n r ), and the complex 
part is a linear function of the conductivity (e" 8=5 c/u)£ 0 ) . 

PREDICTED CHARACTERISTICS 


The curves presented in Figures 2 and 3 were generated by repeated use of our 
own PROGRAM WAVES and later confirmed with PROGRAM MODEIG. The silicon cladding 
was varied from .01 micrometer to 10 micrometers and the complex mode propagation 
constant was calculated. The expected result was that as the cladding thickness 
was reduced to zero, the attenuation decreases to zero in a well-behaved manner; 
however, the results were not well-behaved when the silicon thickness falls below 
1 micrometer. The curves are similar to exponentially damped sinusoids. Extreme 
3/K variations correspond to median values in the a-curve, and extreme a variations 
correspond to median (3/K values. By increasing the conductivity of the silicon 
cladding, the amplitude of the curve oscillations decreases slightly, and the a- 
curve shifts vertically to a higher attenuation. The percent change in attenuation 
compared to dark conditions (cr = 0 Q ) for different conductivities (a = 1.1 cr 0 , a = 
1.25 a Q , 0 = 1.5 0 0 ) is shown in Figure 4. The 3/K curves shift for a conductivity 
change, as well, although not in such a well-defined manner. The percent phase 
shift compared to dark conditions (o = a Q ) for different conductivity changes is 
shown in Figure 5. These effects will be used for intensity and phase modulation 
in a device where a signal source induces photoconductivity changes in a thin sili- 
con cladding on a waveguide, thus modulating a coherent beam in the guide. 


FIELD CALCULATIONS 


We now consider the problem of correlating the local maximum /minimum points 
on the attenuation and mode index curves with the electric and magnetic field 
distributions in the waveguide layers. Results indicate that the presence of a 
thin silicon film (< 98 a; has little effect on the wave function profiles; the 
profiles are similar to those of the three-layer lossless structure (air-dielectric- 
substrate). For the thick silicon film structure, however, the lowest order mode 
of the lossless three-layer structure couples to the modes associated with the 
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semiconductor film (the high-refractive index silicon behaves as a waveguide) . 
Furthermore, the coupling between the modes supported by the three-layer lossless 
structure and the high loss TE t modes* of the silicon waveguide determines the 
attenuation and phase of the complete four-layer structure. 

Our results can be described as periodic coupling between the guided mode and 
other leaky modes of the same guide. First, we examine the partial structure con- 
sisting of a silicon guiding region surrounded by semi-infinite layers of air and 
polystyrene. The attenuation and mode index are shown in Figures 6 and 7. We note 
a phase match condition between the modes of the partial structure (air, silicon 
guide, polystyrene) and the TEq mode of the complete waveguide at cutoff thicknesses** 
for successively higher order modes of the partial structure. The sharp peaks on 
the attenuation curve, for the four-layer structure, occur whenever the guided wave 
is strongly coupled into the high-loss modes of the silicon partial structure; 
conversely, the sharp nulls of apparently zero coupling efficiency occur at thick- 
nesses midway between the values for two adjacent leaky modes of the partial struc- 
ture. This is similar to the results of power transfer calculations for linearly 
tapered directional couplers (3, 4). The abrupt transitions on the mode index 
curve of the complete structure occurs when the phase match condition is satisfied 
and the guided wave couples into successively higher order modes of the partial 
structure. 

We now consider our results in terms of the electric and magnetic field distri- 
butions at the local maximum /minimum points on the attenuation vs. silicon thickness 
curve. The real part of the TEq mode electric field profile in the transverse direc- 
tion is shown for a cladding thickness, t 2 = .007 micrometers (Figure 8(a)). We recall 
that this thickness is below the cutoff value for the silicon waveguide structure; 
note that the wave function profile is not appreciably distorted. 

For the first local minimum (t 2 = . 05ym) , we observe that the field strength at 
the silicon-dielectric interface approaches zero. We also note the exponentially 
decaying solutions in the outer, semi-infinite layers as expected. For the first 
local maximum (t 2 = . 09]im) , we observe a sharp peak in the wave function profile at 
the silicon-dielectric interface as coupling to the TE ? j_ mode of the silicon guide 
occurs (Figures 8(b) and 8(c)). 

Similar behavior is noted for the next local minimum/maximum pair, Figures 9(a) 
and 9(b), (t^ = .13ym and t^ = . 18ym) . The field strength is effectively zero at the 
silicon-dielectric interface for the local minimum, and a sharp peak in the wave 
function is evident for the local maximum as coupling to the TE^ mode of the silicon 
guide occurs. We also note that the field begins to oscillate in the semiconductor 
cladding as we couple into the higher order modes of the silicon waveguide structure. 

Again, for the next local minimum/maximum pair (t 2 ” . 22ym and t 2 = . 26ym) the 
number of field oscillations in the silicon increases. (See figures 9(c) and 9(d).) 
The increase or reduction in field strength at the silicon-dielectric interface is 
also apparent. At this local maximum (t2 - . 26]im) , there is the sharp field peak 
at the interface, but the field decays rapidly through the dielectric indicating 
almost complete energy transfer. For the other maxima cases considered, the sharp 

* TE* denotes guided modes in the semiconductor and TE^ denotes guided modes in 
tne dielectric. 

** Cutoff for the silicon guide occurs when (3/K) c . < n _ 

& Si polystyrene 
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peak was evident at the interface, however, a sizable field was still present in the 
polystyrene dielectric. This indicates that the local maximum (and likely, minimum) 
values used for the calculations are not the precise values as in the former case 
(Figures 9(a) and 9(b)). 

The field plots indicate, then, that the attenuation and mode index of the 
four-layer structure may be explained quite simply as a coupling between the basic 
three-layer lossless waveguide (air-dielectric-substrate) modes and the high loss 
TE f modes of the silicon guide. For a local minimum on the attenuation-thickness 
curve, the field at the semiconductor interface is zero, and for a local maximum, 
a sizable field is set up at the interface which decays rapidly through the guide 
and substrate. Finally, the number of field oscillations in the silicon increases 
as we couple into the higher order modes of the partial structure. 


REDUCTION IN ATTENUATION THROUGH USE OF FIVE-LAYER 

STRUCTURE 


Thin dielectric buffer layers have been used to lower the attenuation losses 
of metal-clad dielectric waveguides (5) . These layers are placed between the di- 
electric core and the metal, and act as buffers to remove a large portion of the 
field from the metal claddings. We now consider the effect of an Si02 buffer layer 
on the attenuation vs. silicon thickness characteristics. 

The result for an Si02 buffer layer (e r = 2.12) of several different thick- 
nesses is shown in Figure 10. We note the familiar damped sinusoidal behavior and 
the corresponding reduction in attenuation. 

o 

The result for an Si02 buffer layer (tgj_Q 2 = 2000A) of several different per- 
mittivities is shown in Figure 11. Again we note the damped sinusoidal behavior 
and the corresponding reduction in attenuation. 

Our studies indicate, then, that the attenuation may be reduced significantly 
with an Si 02 buffer layer while still preserving the oscillatory behavior of the 
attenuation curve; more effective reduction is accomplished with a lower permittiv- 
ity buffer layer. Also, a buffer layer increases the 3/K values slightly but 
decreases the amplitude of the oscillations on the 3/K- thickness curve. 


CONCLUSIONS 


Computer modeling studies on four-layer silicon-clad dielectric waveguides 
indicate that the attenuation (a) and mode index (3/K) behave as exponentially 
damped sinusoids as the silicon thickness is increased. The observed effect can be 
explained quite simply as a periodic coupling between the guided modes of the loss- 
less structure and the lossy modes supported by the high-refractive index silicon. 
Furthermore, the attenuation and mode index are significantly altered by conducti- 
vity changes in the silicon; an amplitude modulator and an intensity modulator have 
been proposed using these results. Predicted high attenuations in the device may 
be reduced significantly with a silicon dioxide buffer layer between the semiconduc- 
tor and the polystyrene guide. 
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Experimental confirmation of the predicted characteristics is still necessary. 
A number of thin-silicon film waveguides have been RF sputtered but attenuation 
measurements to verify the damped oscillatory behavior are forthcoming. Conduc- 
tivity variations of the silicon should demonstrate the modulation capabilities. 
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SUMMARY 


Surface layers capable of supporting optical modes at 10.6 microns have been 
produced in n-type GaAs wafers through 300 keV proton implantation. The dominant 
mechanism for this effect appears to be free carrier compensation. Characteriza- 
tion of the implanted layers by analysis of infrared reflectivity spectra and 
synchronous coupling at 10.6 microns produce results in good agreement with elemen- 
tary models . 


INTRODUCTION 


Integrated optical circuits have vast potential for use as sensing and signal 
processing elements in future Army fire control systems. One of the requirements 
for these systems is 24 hour/all weather operation. In order to meet this require- 
ment, the Army has been emphasizing systems which operate at longer wavelengths, in 
particular, at the 10.6 micron CO laser line. Such systems, including CO^ laser 
radars, miss-distance sensors, ana sensors for smart munitions, form a natural area 
of application for infrared integrated optical circuits. These circuits could be 
used to perform various functions including phased array transmission, heterodyne 
detection, parallel processing and optical correlation. 

Methods which have been used to fabricate optical waveguides and components 
include thin film deposition, epitaxial growth, ion diffusion, and ion implantation, 
(ref. 1) The latter method is a particularly interesting technique that can pro- 
duce electronic, chemical, and optical changes. In crystalline semiconductor 
materials, ion implantation can alter the refractive index through various physical 
mechanisms such as damage, doping, stress, and free carrier compensation. However, 
it is possible, by careful design of the implantation conditions to accentuate one 
of these mechanisms and to minimize the effects of the others. The present exper- 
iments are centered on free carrier compensation of highly doped n-type GaAs through 
medium energy proton implantation. Although this technique has been used to produce 
optical waveguides for use in the near infrared (ref. 2), little work has been done 
to investigate this effect for use at 10.6 microns. 

This phenomenon can be illustrated by a simple dielectric model in which the 
dielectric function of GaAs is composed of Lorentz oscillator terms representing 
the lattice contribution (ref. 3) and a Drude term representing the depression of 
the dielectric function due to the presence of free carriers, (ref. 4) In the 
spectral region between the band gap and the IR phonon resonance, the dispersion 
due to the lattice contribution is much less than that due to the plasma for heavily 
doped material. Thus, to good approximation, the dielectric function for these 
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materials may be written, in terms of wavenumber a, as 


n 2 (a) = K T - 


a(a + ig) 


269 cm 1 « a 


« 2xl0 4 


-1 

cm 


a) 


where is the high frequency dielectric constant of GaAs , taken to be 10.7. a 2 is 
related to the carrier concentration N through 



Ne 2 


(2 ire) 2 m*e 

o 


( 2 ) 


where e is the electronic charge, m* = ,08m is the effective mass 
of electrons in GaAs, and is the vacuum permittivity. The damping 

factor, g, leads to losses and is related to the mobility, y, of the material 
through 


e 

g = 

27Tcym* 


( 3 ) 


where c is the speed of light. During ion implantation, electronic trapping 
centers are created in the lattice which compensate the free carriers, effectively 
raising the refractive index. Neglecting losses, the increase in the refractive 
index is approximately 


An 


p,s 


Cl-N f /N g ) 

2na 2 


(4) 


9 o 

where Gp,s is Gp evaluated using the original carrier concentration N g before 
implantation, and Nf is the carrier concentration remaining in the layer after 
implantation . 


Polished (100) wafers of silicon doped GaAs were used in these experiments. The 
free carrier concentration was approximately 4xl0^/cm^. These wafers were implanted 
at room temperature with 300 keV protons to fluence levels from 10^ to 10^ /cm^. 

The implantation was performed at the Naval Research Laboratory. According to the 
LSS projected range calculations, the compensated region is expected to be sharply 
defined at a depth of approximately 3 microns, (ref. 5) Such a compensation profile 
would be unsuitable for optical waveguiding. However, it is experimentally observed 
that the implantation produces a damage layer with more uniform optical properties 
extending from near the surface to the projected range. 


INFRARED REFLECTIVITY 


Prior to waveguiding experiments the optical properties of the implanted sam- 
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pies were characterized by infrared reflectivity measurements. Differential reflec- 
tivity measurements, made over the spectral range of 4000 cm~l to 800 cm“l, yielded 
interference fringes typical of a thin f ilm/substrate structure, as illustrated in 
figure 1. The interesting features of these curves are the periodicity of the 
fringes and their increase in amplitude as the measurement progresses to shorter 
wavenumbers. A first-order model has been developed to aid in deducing the compen- 
sation and the thickness of the implanted layer from the reflectivity spectrum. 
Basically, losses are neglected and the implanted layer is considered to be a 
film of refractive index nf sitting atop a substrate whose index has been de- 
pressed a small amount An by its higher concentration of free carriers. The refer- 
ence sample is considered identical to the substrate. Expanding the relevant 
reflectivity equations in a McLaurin series expansion to first order in An and 
using equation (4) yields 




r2 

sample 

4a 2 n-N f /N s ) 
1 + P,S 

I sin(27rtn^a)\ 

r2 

reference 

n f (n 2 -l) 

\ ° ) 


(5) 


From this analysis, it can be shown that the thickness of the implanted layer can be 
determined by the separation S of the fringe minima, and the compensation X by the 
fringe amplitudes : 

n f (nf-l)ff 2 

t = <2Sn f )-l ; X = 1 - N f /N a . (6) 

S,s 

The carrier concentration of the substrate may also be determined optically by 
measuring the reflectivity of an unimplanted sample and noting the wavenumber at 
which the reflectivity dips to a minimum at the edge of the plasma resonance, (ref. 
6) At this wavenumber, the dielectric function of the film is very nearly 1. Again 
neglecting losses, the equation expressing this condition is easily solved for the 
carrier concentration: 


N = a£(K L -l)t2TTc) 2 e 0 m*/e 2 (7) 

The more heavily doped the sample is, the more the minimum moves to longer wave- 
numbers. This technique is sensitive for doping levels on the order of 10^^/cm^. 

For example, when N=2xl0^^/cm^ , a m =480 cm~^ ;whereas for N=4xl0^-®/ cm^ , G m = 679 cm“^- . 
This difference in wavenumber is easily resolved by a spectrophotometer. 

The measurements described above were made on a Perkin-Elmer Model 180 double 
beam infrared spectrophotometer fitted with dual specular reflectance attachments. 

As shown in table 1, the compensated layers are roughly 3 microns thick. Although 
the LSS theory predicts about 1 micron of penetration for each 100 keV of acceler- 
ator energy, some dose dependence is observed. The low values of the standard 
deviation, Gt> are due to the periodicity of the fringes, which in turn indicates 
the ion-implanted layers can be modeled as dielectric slabs. The determination of 
the compensation from the reflectivity data is less straight forward. In general, 
the fringe amplitudes build up at a different rate than can be accounted for by the 
dielectric model, even in its exact formalism. The values reported represent the 
average value obtained. The fractional compensation was about 41% for the 10 ^/cm^ 
implant and began to show saturation with a dose of lO^/cm^. Saturation is clearly 
evident in the samples implanted to 10^ and lO^/cm^. The standard deviations, a x , 
are larger in this case, due to the larger uncertainties in the reflectivities and 
the anomaly in the rate of fringe growth. The last two columns of table 1 show the 


233 



calculated carrier concentration of the films and the refractive indices at 10.6 

1 A 0 

microns, respectively. The refractive index of the sample implanted to 10 °/cm z is 
very close to the literature value of 3.275 for undoped GaAs at 10.6 microns, (ref. 7) 

INFRARED OPTICAL WAVEGUIDING 

Since the reflectivity spectra are interpretable in terms of a single dielectric 
f ilm/substrate structure, the equations for a planar asymmetric dielectric waveguide 
were used as the waveguide model: (ref. 8) 

kt/ n ^- (g/k) 2 - = imr (8) 

where k.~2Tx/X^, g/k-n^sinO^ and the phase angles are defined by: 

tan = (n /n . ) 2 tan $■?? (9) 

fx f i fi 

The subscript i refers to either the cover layer o or to the substrate s. Equation 

(8) is the familiar eigenvalue condition which must be satisfied for optical wave- 

guiding to occur and is expressed in terms of the ratio g/k. This factor can be 

interpreted as the effective refractive index of the waveguide for each mode and is 

restricted to values between n and n-. 

s f 

Infrared optical waveguiding was achieved in these samples using a 1 watt CO 2 
laser operating at 10.6 microns. The radiation was focused through an f-10 ZnSe 
lens on a dove shaped germanium prism which effected coupling into and out of the 
waveguide. The emerging beam, totally internally reflected from the base of the 
prism, was imaged as a spot on a thermographic screen. Waveguiding was identified 
by an absorption notch in the reflected spot, indicating that radiation was removed 
from the incident beam. Measurements of the angles at which synchronous coupling 
was achieved were made for both s and p polarizations of the incident beam. From 
these angles, the effective indices of the guided modes were determined. As two 
modes were found, it was possible to numerically invert the mode equations to obtain 
the refractive index- and thickness of the implanted layer. 

The results of preliminary waveguiding measurements at 10.6 microns are shown 
in table 2. Due to experimental limitations discussed below, synchronous coupling 
was only achieved in the two more heavily doped samples, and in those only the 
0-order modes were observed although it is believed those samples will also support 
the TE^ mode. 

The thickness obtained for sample 0519-7 is considerably larger than that 
obtained by IR reflectivity, although the values obtained for the refractive 
index compare well. With sample 0519-4 the agreement is better. The thickness 
obtained from the waveguide measurement is closer to that obtained by IR reflectivity 
while the values obtained for the refractive index are identical. 

Figures 2 and 3 present these results in a different format. These are com- 
puter generated plots of the predicted mode structure for a planar asymmetric 
dielectric waveguide using as inputs for n^ and n values obtained from the IR 
reflectivity spectra. The TE modes are indicated by the solid curves, while the TM 
modes are represented by the dashed ones. 


tan $ 


TE 

fi 


'(e/k) 2 -np 

\n^-C3/k) ' 
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A horizontal line has been drawn on each figure at the thickness determined by 
the IR fringe analysis. The effective indices of the modes supported by each sample 
would be expected to be at the intersection of the mode curves with these lines, as 
indicated by the A symbols on the figures. The corresponding points obtained by the 
synchronous coupling measurements are shown by the + symbols. 

For sample 0519-4 the measured modes are both obtained at larger values of g/k 
than expected and their separation is smaller, implying they intersect the mode 
curves at a larger value of thickness. That the waveguide data points remained on 
the mode curves indicates that the values of the refractive index obtained for the 
film by both methods are in excellent agreement. 

Another point to be made from figure 2 is that regardless of which value is 
used for the guide thickness, the TE-^ mode should be supported by this sample. The 
germanium prism used in these experiments had an angle of 54 degrees which limited 
the lowest value of g/k obtainable to about 2.7. Another prism is being prepared so 

the entire range of g/k will be accessible. If the TE^ mode is observed, the extra 

data point will provide more information about the actual refractive index profile. 

The last feature to be noted from this figure is that the mode curves are 
bounded, in g/k, from n g to np. The refractive index of the substrate for all the 
samples is 2.44 and is determined by the initial doping level of the GaAs . The 
refractive index of the film is determined by the carrier concentration remaining 

after implantation. It must therefore lie somewhere between 2.44 and 3.275, the 

value for undoped GaAs. Thus as the implant dose is decreased, the mode curves are 
all squeezed to the left as the upper limit on g/k decreases. 

Figure 3 presents the same data for sample 0519-7. The spacing between the 
TE q and TM modes indicates the layer is considerably thicker than predicted by 
IR reflectivity. That the mode curves are displaced slightly to the left of the 
theoretical curves illustrates the lower value of rif obtained by the synchronous 
coupling measurements. This sample should also support the TE^ mode and possibly 
the TM^ mode. 


DISCUSSION 

Several conclusions can be deduced from these results. The first of these is 
that proton implantation done at the moderate energy of 300 keV produces a damage 
layer which will support optical waveguiding at 10.6 microns. Although these layers 
are very thin relative to the wavelength used, the index change produced is so large 
that the waveguiding condition is easily satisfied. This layer also appears to have 
very uniform optical properties since empirical results are explainable in terms of 
elementary models. 

There is also very good agreement between the IR reflectivity results and the 
coupling measurements, indicating the experimentally easier reflectivity analysis 
has considerable predictive utility. Although it is not fully apparent from the 
data presented here, it has been found that these are probably complimentary 
techniques: IR reflectivity appears to be more sensitive to the thickness of the 

layer than to the refractive index while the coupling measurements yield greater 
uncertainty in the thickness of the layer and very small standard deviations in the 
refractive index. It is expected that the refractive index profile of ion-implanted 
waveguides will be shown to have a transition region between the guiding layer and 
the substrate rather than a sharp discontinuity as assumed in the models presented 
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here. The complimentarity of the measurement techniques, as well as the systematic 
differences in the experimental results, may be explained by the different way light 
experiences the transition region in each of these techniques. 

Finally it is noted that the effective indices of the guided modes are strongly 
dependent on dose, which is easily controlled. This suggests that structures re- 
quiring different velocities of propagation, such as delay lines, can be fabricated 
simultaneously on the same substrate through proper control of the ion beam. 

The measurements reported here were made on as-implanted samples. While 
current work is directed at completing these coupling experiments, the question of 
losses and post implantation processing to minimize them will be addressed. 
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TABLE 1. -CHARACTERIZATION OF SAMPLES BY INFRARED REFLECTIVITY 
AS-IMPLANTED: 300 keV H + @ ROOM TEMPERATURE 


SAMPLE 

DOSE 
1/ cm^ 

t 

ym 

a 

t 

ym 

X 

a x 

N, 

1/ ctn^ 

n f 

(@10.6y) 

0519-1 

10 13 

2.64 

.06 

.41 

.10 

2.2 x 10 18 

2.79 

0519-5 

10 14 

3.05 

.07 

.57 

.22 

1.8 x 10 18 

2.94 

0519-7 

10 15 

3.35 

.02 

.93 

. 12 

2.8 x 10 17 

3.22 

0519-4 

10 16 

3.31 

.01 

.98 

.19 

6.8 x 10 16 

3.26 

SUBSTRATE 






3.8 x 10 18 

2.44 


TABLE 2 . -CHARACTERIZATION 

OF SAMPLES BY 

co 2 laser WAVEGUIDING 

AS-IMPLANTED : 

300 

keV H + @ ROOM 

TEMPERATURE 


SAMPLE DOSE 

MODE 

(3/k 

t 

n f 

1/ cm^ 



ym 

1 


TE 

3.071 



0519-7 10 15 

0 


4.54 

3.20 


TM 

0 

3.026 




0519-4 

10 16 

TE 

o 

3.087 

3.71 

3.26 



TM 

0 

3.014 
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THICKNESS (um) 


SAMPLE: 0519-4 


EFFECTIVE INDEX, g/k 

Figure 2.- Comparison of waveguiding and reflectivity results for sample GaAs 
0519-4(10 H /cm ). Curves and points indicated by the symbol A are 
predictions based on IR fringe analysis. Results of waveguiding experiments 
are indicated by the + symbol. 


SAMPLE: 0519-7 



EFFECTIVE INDEX, g/k 

Figure 3.- Comparison of waveguiding and reflectivity results for sample GaAs 
15+2 

0519-7(10 H /cm ) . Curves and points indicated by the symbol A are 
predictions based on IR fringe analysis. Results of waveguiding experiments 
are indicated by the + symbol. 
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AlGaAs / GaAs TRANSVERSE JUNCTION STRIPE LASERS WITH DISTRIBUTED FEEDBACK 


M.J. Hafich, R.A. Skogman, P.E. Petersen 
Honeywell Corporate Technology Center 
Bloomington, Minnesota 
and 

H. Kawanishi 

Kogakuin University, Japan 


ABSTRACT 

Two AlGaAs transverse junction stripe lasers, one with distributed feedback 
and one with a distributed Bragg reflector, have been fabricated. Stabilized single 
longitudinal mode operation was obtained from both laser structures; temperature 
dependence of the lasing wavelength was 0.5&/°K for the TJS/DBR laser and 0.63X/SK 
for the TJS/DFB laser. 


INTRODUCTION 

The transverse junction stripe (TJS) laser in the AlGaAs/GaAs material system 
combined with a periodic feedback structure such as a distributed Bragg reflector 
(DBR) has the potential of being a stable, single mode optical source for GaAs 
integrated opto-electronics and optical communications. Periodic feedback eliminates 
the longitudinal mode hopping associated with conventional TJS lasers and provides 
control of the laser oscillation wavelength with very little device to device vari- 
ation. The TJS laser configuration (see Figure 1.) has both optical and electrical 
advantages: very low threshold currents (as low as 15mA has been reported for 

faceted TJS lasers^") and single transverse mode optical fields stable over a wide 
range of currents and temperature. ^ > 3 

TJS lasers with periodic feedback have been fabricated in two geometries (see 
Figure 2.)^. An interferometric and wet chemical etching technique was used to create 
a feedback grating across the entire pumping region for the distributed feedback (DFB) 
TJS laser and to create the separate distributed Bragg reflectors for the TJS/DBR 
laser. The TJS/DFB laser was a double heterostructure device grown by liquid phase 
epitaxy (LPE) and had a third order grating etched in the top Al^ 2 ^ a Q g^ s ^ a ^ er * 

The grating was buried by growing an AIq 35^0 65 AS layer on the grating by metal 
organic chemical vapor deposition (MO-CVD) . The TJS/DBR laser was also fabricated in 
an LPE double heterostructure. The top AlGaAs layer was thinned to O.lym over more 
than half of the laser so that the grating would be close to the GaAs active layer 
and optical field. Single mode operation in both configurations was obtained. The 
thermal shift of the laser wavelength in both cases was less than 1A/°K, compared to 
the 3A/°K shift of the spontaneous emission peak. 
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DEVICE PROCESSING 


The processing for the TJS/DEB and the TJS/DBR combines conventional AlGaAs/GaAs 
processing techniques with several special processes. The major steps are shown in 
Figure 3. For both types of lasers the gratings were in the AlGaAs layer directly 
above and within O.lym of the active layer. This avoids introducing nonradiative 
recombination of the injected carriers. The A1 20 Ga .80 As l a Y er °f the TJS/DFB laser 
provides some optical confinement without significantly adding leakage currents 
around the active GaAs junction. A final layer of AIq 35 GaQ 55 AS was grown by 
MO-CVD to increase the optical field interaction for optical feedback. The GaAs 
homojunction was formed by a two step Zn diffusion and ohmic contacts were evaporated 
on both sides of the laser crystal. Individual devices were cleaved from the wafer 
with Fabry Perot cavity lengths on the order of 500ym. The inclusion of facets for 
the Fabry Perot cavity simplifies testing since the structure would oscillate even 
at a temperature where there was not sufficient gain for the distributed feedback 
mode to oscillate. 


EPITAXIAL GROWTH 

The TJS/DBR double heterostructure was grown entirely by LPE . The three layer 
structure consisted of a ‘4pm, u-A 1 q ^ 35Gsq # 55 AS layer; a 0.4ym, n-GaAs active layer; 
and a 7ym, n-Alg 35^0 65^ s to P con fi n i n g layer. The n-GaAs layer was Te-doped to a 
carrier concentrat ion of approximately 2xl0^°cm~^. The structure of the TJS/DFB laser 
was similar except that the top confining layer of the double heterostructure was 
formed using a hybrid LPE/MO-CVD process. First, an undoped, 0.2ym, u-AIq # 20^ a 0 80 As 
layer was grown during the LPE process. After the DFB grating was etched into this 
layer, an n-Al Q ^GUq q 5 As la Y er was grown on the grating surface by the MO-CVD tech- 
nique. Using MO-CVD insured the successful regrowth of AlGaAs that could not be 
obtained with LPE. 


GRATING FABRICATION 

The periodic structures for the TJS/DFB and the TJS/DBR lasers were formed by 
etching corrugations into the top confining layers of the double heterostructure using 
an interf erometrically generated photoresist pattern as the etchant mask (Fig. 4). A 
HeCd laser (A=442nm) was used as a light source for the interferometer. Third order 
gratings with periods between 350 and 400 nm were produced. Very thin photoresist, 
^O.lym, was used to obtain the resolution needed to form the grating. Special care 
was taken to align the photoresist gratings parallel to the (Oil) direction of the 
laser crystal. After a photoresist grating had been formed on the AlGaAs surface, 
a preferential etchant, 3:1:1 9 was use< ^ t0 the grating into the 

confining layer. In the TJS/DBR laser, the pumping region of the laser was protected 
from the grating etch by a previously applied stripe of photoresist. In both con- 
figurations, the process goal was to place the gratings as close as possible to the 
GaAs active layer without diminishing the gain of the active region by mechanical 
damage. For the TJS/DFB laser, the photoresist was cleaned after the grating fabric- 
ation and a layer of AIq ^ 35 Gag , 55 AS was grown by MO-CVD on the grating surface. The 
next step in the TJS/DBR process was to sputter Si^N^ on the cleaned wafer for a Zn 
diffusion mask. 
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ZINC DIFFUSION 


The Zn diffusion that forms the p+_p region of the TJS laser diode was per- 
formed in a semi-open ampoule in a two step process. 170nm of low oxygen content 
silicon nitride was sputtered over the entire wafer and a window for Zn diffusion 
was opened by Freon plasma etching. The initial p+ diffusion was made at 700°C in a 
flowing atmosphere with a ZnAs source. Diffusion times up to two hours were re- 
quired to have the p+ region penetrate to a point just above the GaAs active layer. 
To form the p-region, the sample was annealed at 900°C in flowing hydrogen with the 
epilayers lying face down against the quartz boat to minimize material loss. 

Figure 5 is a scanning electron microphotograph showing the Zn diffusion profile 
through the double heterostructure. Because the energy bandgap of the lightly doped 
n-GaAs region was slightly wider than that of the heavily doped p-GaAs active region, 
the low doped n-GaAs layer in the grating region of the TJS/DBR laser acts as a low 
loss optical passive waveguide for laser light emitted in the heavily doped p-region. 

The final step of the fabrication process was to make ohmic contacts to the n 
and p regions of the lasers. Evaporated AuGe and Cr/AuZn were used as the n and 
p ohmic contacts. Both contacts were annealed simultaneously at 460°C. Each wafer 
was cleaved into separate devices with lengths of 300 to 600 pm* For heat sinking 
during testing, the lasers were mounted with In solder p-side down on the Au plated 
copper heat sinks. 


RESULTS 

A scanning electron micrograph of a finished TJS/DBR laser without ohmic contacts 
is shown in Figure 6. Figure 7 shows the lasing spectra of a TJS/DBR laser for heat 

sink temperatures from 133°K to 153°K. In this sample, approximately 9% A1 was in- 

cluded in the active layer making the lasing wavelength shorter than that for GaAs. 
The laser was pumped by applying current pulses with a duration of 200ns and repeti- 
tion rate of 3Kbits. The laser had a 140pm pumping region with a 220pm long distrib- 
uted Bragg reflector on one end and a cleaved facet on the other end of the cavity 
(See Figure 6). Lasing output was measured at the cleaved facet. At 133°K, the 
wavelength for the DBR mode was longer than the peak wavelength of the spontaneous 
emission (a few f.p. modes appear) by about 28^, and the intensity of this mode was 

not maximum because mismatching occurred between the DBR mode and the spontaneous 

peak. At 147°K, the matching was obtained and the intensity of the DBR mode in- 
creased substantially. The threshold current was 1.3 A at 147°K. Mismatch between 
the two wavelengths began to appear again as the heat sink temperature was further 
increased, and the DBR mode wavelength became shorter than that of the spontaneous 
peak at 153°K. In this temperature range, the DBR mode shifts continuously from 
8008^ without longitudinal mode hopping. 

The pulsed lasing spectra of a TJS/DFB laser (cavity length of 550pm) for heat 
sink temperatures varying from 289°K to 307°K is shown in Figure 8. The experimental 
procedure was similar to that used for the TJS/DBR laser. The thermal behavior of 
the DFB mode was very similar to that of the DBR mode. The DFB mode was coincidental 
with the peak of the spontaneous emission at 299°K. The DFB mode oscillated at room 
temperatures because of an accurate calculation of the equivalent refractive index 
of the active waveguide, and the grating period. 

Figure 9 shows the light output vs. injection current (I-L) characteristics as 
a function of the heat sink temperature for the TJS/DFB laser. At 289°K, where there 
was a large mismatch between the DFB-mode and the spontaneous emission peak, the I-L 
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characteristic was soft due to the relatively large amount of spontaneous emission. 
As the temperature was increased to 299°K, the intensity of the spontaneous emission 
decreased and a much sharper I-L characteristic was obtained. At 307°K, the I-L 
curve was similar to that of 299 K; however the light output began to saturate with 
the increase of injection current due to the heating of the active region. This 
data shows that the differential quantum efficiency was nearly independent of the 
heat sink temperature. 

Temperature changes have two effects on the optical output of TJS/DFB and 
TJS/DBR lasers. First, the peak of the spontaneous emission shifts with temperature 
causing the amplitude of the DBR or DFB mode to vary as was shown in the data of 
Figures 7 and 8. The shift in the spontaneous emission peak was measured to be 
3A/°K which was in close agreement with the shift of the energy gap in GaAs . 
Secondly, the DFB and DBR modes shift with temperature. The DBR thermal mode shift 
was measured to be 0.5A/°K, and the DFB thermal mode shift was measured to be 
0.63A/°K. The stability of the modes of the. periodic feedback lasers depends largely 
on the laser waveguide's refractive index variation with temperature. Calculations 
of the difference in the waveguide effective refractive index at different temper- 
atures were in close agreement with the experimental data, assuming an index change 
of 4 x 10 -4 /°C in a simple slab waveguide produces a thermal temperature mode shift of 
1A/°C. Because of the dependence of the refractive index the thermal shifting of 
the DFB or DBR modes cannot be totally eliminated; however, unlike the mode hopping 
of the Fabry Perot cavity, the DFB or DBR mode shift was small and continuous. 


CONCLUSION 

Two new laser structures have been fabricated: an AlGaAs/GaAs TJS/DFB laser 
and an AlGaAs/GaAs TJS/DBR laser. Both devices operated in a single longitudinal 
mode; maximum power was obtained at 147°K (threshold current 1 . 3A) from the TJS/DBR 
and at 298°K (threshold current 0.78A) for the TJS/DFB laser. No mode hopping was 
observed for the temperature ranges used. The temperature dependence of the lasing 
wavelengths was 0 . 5A/ °K and 0.6A/°K for the TJS/DBR and the TJS/DFB lasers, respect- 
ively . 
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Figure 1.- Schematic of a TJS laser. 
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Figure 2.- Structure of the TJS/DFB and TJS/DBR laser diodes* 
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CLEAVE OUTPUT FACETS. 




Figure 3.- Major processing steps for TJS/DFB and TJS/DBR laser diodes. 
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Figure 4.- Grating fabrication process. 


247 



AIGaAs 


GaA 
ACTIVE 
LAYER 



P REGION 


GaAs 

SUBSTRATE 






6,000X 








n REGION 


Figure Scanning electron micrograph of the Zn diffusion in 

an AlGaAs/GaAs double heterostructure 0 
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Figure 6.= Scanning election micrograph and schematic of the 
completed TJS/DBR laser. 






Figure 7.- Mode spectra of the AlGaAs/GaAs TJS/DBR laser 
between 133°K and 157°K. 
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Figure 8.- Mode spectra of the AlGaAs/GaAs TJS/DFB laser 
near room temperature. 
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LIGHT INTENSITY (ARB. UNITS) 


Figure 9 



- Light output as a function of injection current (I-L) 
characteristics of the TJS/DFB laser. 



EVAPORATED As 2 S 3 LUNEBURG LENSES FOR LiNb0 3 :Ti OPTICAL WAVEGUIDES* 
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SUMMARY 


Because of their relative ease of fabrication, advanced state of design, and low 
aberrations, Luneburg lenses are attractive candidates as elements in integrated- 
optics devices, both for simple beam-forming applications and for information pro- 
cessing. Few materials, however, possess the combination of high refractive index, 
low absorbance, and ease of deposition required for making Luneburg lenses suitable 
for use on high-index optical waveguides, such as those formed by diffusion of Ti 
into LiNb 03 * We have found that Luneburg lenses of good quality can be formed on 
such waveguides by evaporation of arsenic trisulfide glass through simple masks. 

Using only two thin circular-aperture masks, we could readily obtain lenses with 
focal spots of a few times the diffraction-limited width at f/4. These lenses have 
been designed for and tested at both visible (633 nm) and infrared wavelengths. 

Losses are generally small in both wavelength regions, although much of the guided 
light is coupled into the lens region. There exist restrictions on the steepness of 
the lens profile near the edge if partial coupling to local modes in the lens region 
is to be avoided. Since an irreversible change in refractive index can be effected 
in the AS 2 S 3 glass by exposure to blue or ultraviolet light, the focal length of the 
lens can be shortened optically after fabrication. This technique permits in-situ 
optimization of lens characteristics with no apparent degradation in focal-spot qual- 
ity. Procedures for the design, fabrication, and testing of lenses of this type will 
be described. 


INTRODUCTION 


Realization of integrated optics devices requires development of integrated ana- 
logs to the components familiar in conventional optics. Among the most important of 
such components is the lens. Integrated lenses can be useful not only for the usual 
focusing and beam-forming operations, but also as data-processing elements exploiting 
the Fourier-transform property of the lens. We have recently developed techniques 
for the design, fabrication, and evaluation of one type of integrated lens — the 
Luneburg lens — on planar optical waveguides formed by diffusion of titanium into 
LiNbO^. The lenses are formed of overlayers of AS 2 S 3 glass deposited onto the 
LiNb 03 surface by thermal evaporation through simple masks. Few materials other than 
chalcogenide glasses like As 2 S 3 possess the high refractive index, low absorbance and 
ease of deposition required for making this type of lens on a high-index material 
like LiNbC^. the Present paper we describe and exemplify our current procedures 
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for making AS 2 S 3 Luneburg lenses. We also describe how the well-known short- 
wavelength photosensitivity of this material can be used to adjust focal length 
after lens fabrication. 


BACKGROUND 


Three principal classes of lenses have been investigated for use in integrated 
optics . 

Geodesic waveguide lenses , figure la, are formed by creating a spherical depres- 
sion in the waveguide substrate and then fabricating the waveguide by diffusion or 
film deposition (ref. 1). A ray of light encountering the depression enters it in a 
direction determined by the perpendicular distance of the ray from the center of the 
depression. It then proceeds along a geodesic curve until it encounters the edge of 
the depression again, whereupon it reenters the planar region of the guide. The 
index of refraction for the guided mode is practically unchanged in the depression, 
but the path length depends on the particular geodesic traversed by the ray; focusing 
occurs because of this differential path length. Since focusing in geodesic lenses 
is a geometrical effect, all waveguide modes are focused with the same focal length. 

Spherical geodesic lenses suffer from spherical aberration which must be cor- 
rected for high-performance uses. This can be done by placing a carefully controlled 
layer of material in the depression, (ref. 2 ), or by using correcting layers external 
to the depression (refs. 3,4). Both these methods remove the mode-independence of 
the focal length. An alternative way to remove spherical aberration is to use an 
aspheric depression (ref. 5). 

Geodesic lenses have several drawbacks. The fabrication techniques are slow and 
costly. Entrance edges must be rounded to prevent excessive losses and scattering 
from an abrupt transition from the planar guide into the depression. This rounding 
tends to lead to an uncertainty in the focal length. Finally, the waveguide in and 
around the depression must be made stronger because the curvature of the depression 
tends to introduce radiation losses in weak guides. This is a particular problem in 
LiNb 03 devices, where crystal size and cost limitations often will necessitate use of 
short focal lengths. 

Diffraction lenses , figure lb, are waveguide analogs to the Fresnel zone plate. 

A diffraction lens is a grating whose pattern approximates the thick hologram formed 
by recording the interference pattern created by the intersection of a converging 
cylindrical wave and a plane wave. The lens consists of an overlayer of lines that 
follow the fringes of the interference pattern. In experimental work on such lenses 
(ref. 6 ), the line pattern was calculated by computer and replicated by e-beam 
lithography to generate a mask which was then used to fabricate the lens 
photolithographically . 

The principal advantages of diffraction lenses are low cost, speed and simplic- 
ity of fabrication, and potential for automation. Their principal disadvantage is 
their extremely limited field of view. 

The Luneburg lens (ref. 7), is one of the classical gradient-index lenses. It 
has found application in microwave antennas and beam formers, but until the advent of 
integrated optics it was impractical in optical systems since, in a bulk optical sys- 
tem, the refractive index at the edge of the lens must equal that of air. A 
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three-dimensional Luneburg lens is conventionally defined as a spherically symmetric 
refractive index distribution that perfectly focuses the surface of a fixed sphere 
onto the surface of a second fixed sphere. In the two-dimensional world of inte- 
grated optics the distribution is circularly symmetric and the spheres are replaced 
by circles. Ordinarily we consider the situation where one sphere becomes of infi- 
nite radius and the other, in the geometrical-optics approximation, shrinks to a 
point; so the lens behaves like a conventional lens, bringing a plane-parallel beam 
to a focus. For specific optical data-processing operations, though, other configu- 
rations, possibly not conforming to the conventional definition, might be desired. 

The practicality of Luneburg lenses in integrated optics results from the cir- 
cumstance that the index of refraction at the lens edge is simply that of the wave- 
guide mode rather than that of air. The required refractive-index distribution can 
be produced in a waveguide by deposition onto the guide surface of an overlayer of 
material with a prescribed variation in thickness, as indicated in figure lc. This 
thickness profile is produced by sputtering or evaporating the lens material onto 
the waveguide surface through a circular mask with shaped edges. Diffraction-limited 
Luneburg lenses (ref. 8 ) have been fabricated using a Ta 205 overlayer on a 7059 
sputtered glass waveguide. 

Any material that is not highly absorptive can be used to make Luneburg lenses, 
but only materials with a refractive index higher than the surface index of the 
waveguide can produce mode-index changes large enough to make short-focal-length 
lenses. For LiNb 03 waveguides, the surface index is in the range 2.1 to 2.3, depend- 
ing on wavelength and polarization. Chalcogenide glasses have refractive indices in 
the range 2.3 to 2.5 and they can readily be deposited by sputtering or evaporation. 
One of the simplest materials to use is amorphous AS 2 S 3 , which is readily available 
in the form of chunks of glass from which material yielding good quality films may be 
evaporated at 500-700°C. We have found this material quite suitable for the forma- 
tion of Luneburg lenses on LiNb 03 . 

It is important that the overlay film be tapered in thickness at the lens edges 
so that spurious modes in the lens region not be excited. These modes occur when 
the required overlayer thickness is sufficient to allow higher modes in the under- 
lying waveguide to exist. As the film thickness is increased, the single mode of the 
waveguide is slowly "pulled" up in effective index, until a point is reached when the 
index rapidly increases beyond the waveguide surface value, toward the bulk index of 
the film. As this rapid change is occur ing, a new mode appears in the waveguide, a 
mode that was cut off for thinner films. This mode replaces the original single mode 
of the guide; its index rapidly increases to the vicinity of that of the original 
guided mode, where it remains until it too is finally pulled up into the film. This 
process continues, with new modes being pulled up into the guide from cutoff to 
replace lower modes that have been pulled into the film. As a result, there is, 
roughly speaking, almost always a mode present having an effective index very near 
that of the original waveguide. If the film thickness changes too rapidly at the 
edge, then in the region where the original mode is lifted into the film, strong 
coupling can occur between the incoming mode and the spurious mode lifted from cut- 
off by the presence of the film. This coupling occurs because the effective index 
difference between the incoming mode and the spurious one is small and is changing 
more slowly than that between the incoming mode and its continuation in the film 
region. The result of such coupling is the appearance of multiple focused spots and 
increased apparent loss of transmission. The increased loss may not be too serious, 
but the transmission of the spurious modes through the lens is deleterious for most 
applications and can be disastrous for processing applications. 
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DESIGN 


The design of the Luneburg lens involves the following steps: 

1. Select the desired operating wavelength, lens aperture and focal length; 
calculate the f /#. 

2. Select the overlay material, waveguide profile, and waveguide parameters. 

3. Select the operating mode and polarization. 

4. Solve the wave propagation equation for the specified conditions and for an 
overlay of arbitrary thickness to find mode indices required for propagation 
both inside and outside the lens region. (As noted by Southwell (ref. 9), 
it is easier to calculate the overlay thickness as a function of the mode 
index) . 

5. For the given f / # evaluate the Luneburg-Morgan integral, which gives the 
mode index required at a given radius in the lens region to obtain proper 
focusing. 

6. Using the results of 4, determine the lens thickness needed to obtain the 
desired mode index. 

7. From the results of 5 and 6, tabulate the required lens profile. 

8. Design- a mask arrangement which will produce a lens having a profile close 
to the specifications of 7. 

The first three steps require little discussion. The lenses discussed in this 
paper are designed for a wavelength, exterior to the guide, of 633 nm. They have 
apertures of 6 mm and design speeds of f/4.3 for full aperture. Since rays passing 
very close to the lens edge are not always well focused, we often design for a 
restricted aperture a little smaller than the lens diameter. The lenses are general- 
ly made on single-mode waveguides, for which we assume a Gaussian index profile with 
a diffusion length of 2 ym. Our design equations are strictly correct only for TE 
modes; but since for highly asymmetric waveguides the only distinction between TE 
and TM modes is in the substrate index, good designs for TM modes can also be ob- 
tained by simply using the appropriate value for that parameter. 

To find the overlay thickness required to obtain a desired mode index (step 4) , 
we have made a straightforward extension of the calculation in reference 9 for a 
uniform waveguide layer to the case of an inhomogeneous guide characterized by a 
Gaussian refractive index profile - that is, n (y) ~ n^ + (n s - n^) where 

n-^ is the bulk index of the lithium niobate, n s is the index at the surface of the 
waveguide layer, A is the diffusion length, and y is depth below the surface into the 
waveguide. We used the WKB approximation (ref. 10) to determine the mode propagation 
characteristics of the guide. Since most of the guided light in the lens region 
propagates in the overlay material rather than in the original waveguide, the exact 
characteristics of the diffused waveguide do not influence the calculated lens pro- 
files greatly; but this is not to say they do not need to be known with some accu- 
racy. As an example, in one calculation a change of 40% in the assumed surface index 
n s led to a change of 10% in the calculated lens thickness at the center. 
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The required mode-index to obtain focusing at a given lens radius is obtained 
(step 5) from the solution of the Luneburg-Morgan integral equation. 


N(r) = 2 
^ext 71 



z) 


1/2 


sxn 


-1 


(u + z) 


(u 2 + 2 z ) 1//2 


du. 


a) 


In this equation, N(r) is the mode index at radius r in the lens region, while N ex p 
is the mode index outside the lens. The parameter t is the reciprocal of twice the 
f /number of the lens, and z = tR, where R = 2r N(r)/A N ex f A is the lens aperture 
In the form presented here, the integral is easily evaluated to 5 decimal place 
accuracy by a single 16-point Gaussian quadrature. 

-Steps 6 and 7 merely involve tabulation and possibly interpolation. 

Given the lens profile required for a particular focusing requirement, we need 
to determine apertures and positions of masks suitable for depositing a lens with 
this profile on the waveguide. As stated earlier, we have found that fairly good 
approximations to profiles of interest to us can be obtained using only two thin 
masks. Our model for the deposition system is a simple generalization of that de- 
scribed in reference 8 , and our present design procedure might be termed computer- 
aided guesswork. 

The model for the evaporation system, and the system itself, are illustrated 
schematically in figure 2 . 


The evaporation source is modeled as a uniform distribution over a circle of 
radius b of point sources, each emitting AS 2 S 3 molecules uniformly into the hemi- 
sphere above the source. The extent to which this simple model describes actual 
evaporation from a crucible has not been investigated experimentally. It is possible 
that the geometric radius b ! of the top of the crucible or the top of the melt 
differs from the effective source radius b that must be used in the model. 


The lens-profiling mask is modeled as a simple circular hole in an infinitesi- 
mally thin sheet parallel to the waveguide surface. The distance hp of this sheet 
from the surface and the radius s of the hole are varied by trial and error to opti- 
mize the fit to the desired profile. The fits are to a lens profile normalized to 
unity at the center; we rely on a separate measurement of the central thickness to 
obtain the proper lens thickness. To obtain a good fit near the edge of the lens, 
an additional mask very close to the substrate is used. It is positioned experimen- 
tally to reduce the lens thickness to zero at the desired aperture. 

Presumably with the use of enough additional masks one can approach the design 
thickness within an arbitrarily small error. More important, though, is the question 
of how small deviations from the ideal profile affect the operating characteristics of 
the lengs. We intend to investigate this in future work. 


FABRICATION AND CHARACTERIZATION 


The AspSp lenses are fabricated in a conventional bell-jar high-vacuum evapora- 
tion system. The system is typically pumped down to a pressure of about 1.0 x 10"^ 
torr by the oil diffusion pump. The source is AS 2 S 3 fused glass which has been hand 
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ground to form a fine powder. The AS 2 S 3 powder is thermally evaporated from a quartz 
crucible held by a conical tungsten basket. The crucible temperature (unmonitored) 
is estimated to be between 500 and 700°C. The AS 2 S 3 molecules travel through the 
series of masks illustrated in figure 2 and are deposited onto the LiNb 03 substrate. 
Evaporation times vary from 15 seconds to 1 minute and are controlled by a hand- 
operated shutter. Film thicknesses were estimated during the evaporation process 
by the weight of the AS 2 S 3 used as the source and the evaporation time. After the 
lenses were removed from the system, their thickness profiles were verified by 
Talysurf. We have recently installed a quartz crystal thickness monitor into 
the system so that we can accurately monitor the evaporated film thickness 
in-situ. Care is taken to remove any fumes generated by the evaporation system 
from the room. 

To characterize the optical quality of the Luneburg lenses we have made, we 
examined the light distribution in the focal plane. The experimental arrangement is 
shown in figure 3. The input beam was directed through a variable-width aperture and 
then coupled into the waveguide with a rutile prism. The guided beam passed through 
the lens and then was coupled out with another rutile prism. The focal plane was 
external to the waveguide and output coupler. An f/2 imaging lens was used to re- 
focus the focal region onto a diode-array scanner. The scanner consists of 128 
photodiodes, each 15 ym wide and 26 ym long, spaced 25 ym apart. The scanner output 
was directed into an oscilloscope where it was displayed as a plot of detected inten- 
sity in the focal plane as a function of position. 


EXAMPLES 


A lens designed to focus the TMq mode of an in-diffused waveguide supporting 
only one mode of each polarization was fabricated. The source was 0.36 g of AS 2 S 3 
glass in an 18.3 mm diameter quartz crucible positioned 100 mm below the waveguide. 
The profiling mask had an aperture of 4.7 mm and was placed 22 mm below the wave- 
guide, while the edge-defining mask, with an aperture of 5.9 mm, was 0.5 mm below the 
guide. A 20 second evaporation yielded a lens with a central thickness of 0.74 ym, 
close to the design value of 0.75 ym. The measured lens profile as determined by 
Talysurf is compared with the design profile in figure 4. Although the lens as fab- 
ricated was about 20% too thick near the edge, it focused a TMq beam only slightly 
smaller than the full aperture with a focal length of 2.5 cm, close to the design 
value of 2.58 cm. The flatness of the fabricated lens relative to the design can be 
alleviated by introducing additional profiling masks. This lens was designed assum- 
ing values of 2.293 for the refractive index at the waveguide surface and 2.40 for 
the AS 2 S 3 film* More recent measurements indicate that the refractive index of the 
as-deposited film at 633 nm is at least 2.42 and may be higher. This will not affect 
the calculated profile greatly, but will have an effect on the design overall 
thickness . 


An example of the focusing qualities of one of these lenses, as determined by 
the focal-spot scanning procedure previously described, is shown in figure 5. This 
lens had a central thickness of 0.67 ym and after exposure to an undetermined amount 
of ultraviolet light had a measured focal length of 2.67 cm. For a one-dimensional 
lens, the half-power diffraction-limited spot size is given by: 


S = 0.8859 


f 


^ext w 


( 2 ) 
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where A 0 is the external wavelength of the light used, N ex t is the mode index outside 
the lens region, f is the lens focal length in the waveguide, and w is the input 
aperture. We used a 4 mm aperture in this experiment; consequently the minimum half- 
power spot size is 1,63 pm. The measured value is 5.4 pm; so at this aperture this 
lens is about three times diffraction-limited. Figure 5 shows that the lens is quite 
symmetrical with little of the focused beam energy outside the central peak. 

The post-fabrication optical shortening of the focal length of the lens by expo- 
sure to blue or ultraviolet light is illustrated by the following two examples. 

The first lens was fabricated by evaporating from an 81.6 mg charge of AS 2 S 3 
glass for 15 seconds. This 8 mm diameter lens had an original TMq focal length of 

12.5 cm. After a 60 mJ exposure to ultraviolet light the focal length decreased to 

10.5 cm. Subsequent exposures of 30 mJ and 60 mJ shortened the focal length to 
9.3 cm and 8.2 cm, respectively. After a total ultraviolet exposure of 540 mJ, 
the TMq focal length had decreased to 5.4 cm. 

The second lens was fabricated by evaporating from an 80.6 mg charge of AS 2 S 3 
glass for 20 seconds. This 8 mm diameter lens had an original TMq focal length of 
5.6 cm. The exposure source was the 4880 A line of an argon laser. An exposure 
of 600 mJ shortened the focal length to 4.5 cm. After a total exposure of 2.4 J, 
the TMq focal length had decreased to 3.8 cm, with no apparent aberrations induced 
by the exposure process. 
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CURRENT AND FUTURE OPTICAL INTEGRATED CIRCUIT 
TRENDS FOR INFORMATION PROCESSING * 

M. K. Barnoski 

TRW Technology Research Center 
El Segundo, CA 90245 

The maturing technology in fiber optics has generated renewed interest in 
the development of integrated optic devices applicable to single mode guided 
wave optical systems for data transmission and data processing. Integrated 
optics represents the development of optical circuits which perform the 
functions of generation, modulation, switching, coupling, and detection of 
optical signals in an integrated, compact and rugged format. Many integrated 
optic devices can be operated at several GHz speed with very low power 
consumption. The batch fabrication processes developed for microelectronics 
are readily adaptable to the production of integrated optical circuits, thus 
reducing the production cost. This presentation will review the status of 
integrated optic components development and the integration of these devices 
for information processing. The development of specific optic circuits, such 
as int-'- rated optic spectrum analyzer, optical correlator, high speed optical 
anal og-to-digital converter, optical switch network, and optical logic circuit, 
will be reviewed. The presentation will also try to delineate the future trend 
of integrated optics for data processing. 

* Because this paper was not available at the time of publication, only the 
abstract is included. 


263 




A BROADBAND RF CONTINUOUSLY VARIABLE TIME DELAY DEVICE 


Frederick W. Freyre 

Research Laboratories, Hazel tine Corporation 


SUMMARY 


A method for implementation of continuously variable time delay of broadband RF 
signals is described. The method uses Bragg Cell and optical heterodyne technology. 
The signal to be delayed is applied to the Bragg Cell acoustic transducer, and the 
delay time is the acoustic transit time from this transducer to the incident light 
beam. By translating the light beam, the delay is varied. Expressions describing 
the Bragg Cell diffraction, lens fourier transformation, and the optical heterodyne 
processes are developed. Included are specifications for the variable delay includ- 
ing bandwidth, range of delay, and insertion loss. Applications include radar 
signal processing, spread spectrum intercept, radar ECM, and adaptive array antenna 
processing . 


INTRODUCTION 


Continuously variable time delay of broadband RF signals is a valuable signal 
processing function. Methods of achieving it include: 

o Digital - The signal is digitized, stored, and converted back to analog. 

This method can yield very long or infinite delays, but is bandwidth limited 
depending on the number of A/D and D/A converters and their speed. The 
length of signal delay is limited by the amount of available memory. Digital 
methods are characterized by large size and power consumption. 

o SAW Dispersive Transducers and Reflective Array Compressors - In one version, 
the signal is applied to a dispersive transducer, then frequency converted, 
and finally applied to another dispersive element which has equal but oppo- 
site slope. The delay depends on the LO frequency used for frequency conver- 
sion. Large insertion loss and difficulty in matching the two dispersive 
slopes are problem areas. 

o Magneto-Static Wave Devices - MSW devices exhibit a dispersive time delay 
characteristic which varies with the applied bias field. Similarly, as with 
the SAW approach, two dispersive elements having equal but opposite slopes 
are used. A delay range of 165 to 180 ns and a bandwidth of 250 MHz have 
been reported (Ref. 1). Difficulties lie in matching the two dispersive 
slopes . 

A new method of achieving broadband variable delay which promises faithful signal 
reproduction;, medium BW, and a wide range of delay uses acous to -optical and optical 
heterodyne technology. The delay is varied through electronic, not mechanical, 
means. Insertion loss is inversely proportional to the square of the laser power. 
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SYSTEM ARCHITECTURE DESCRIPTION 


Figure 1 illustrates the system architecture. Included is a laser, a Bragg Cell, 
a beam translator which can include another Bragg Cell, a Fourier Transform lens, a 
photomixer and some mirrors and beamsplitters. The broadband signal to be delayed is 
applied to the Bragg Cell (Bl) launch transducer. The resulting acoustic signal then 
propagates to a laser beam, which is incident at the cell center frequency Bragg 
Angle. The resulting diffracted light beams are frequency shifted by the acoustic 
frequencies. A beam combiner combines the diffracted beams and a local oscillator 
(LO) beam derived from the same laser. A Fourier Transform lens then focuses the 
combined beam onto a broadband semiconductor photomixer so that optical heterodyning 
occurs. The result is coherent signal reconstruction for those signal frequency 
components that are within the Bragg Cell passband and whose diffracted beams inter- 
acted with the LO. The time delay is essentially the acoustic transit time from the 
launch transducer to the incident light beam. Delays ranging from several nano- 
seconds to several microseconds are feasible. The time delay variation results 
from translating the incident laser beam along the length of the Bragg Cell while 
maintaining Bragg angle incidence. Non-mechanical electronic beam translation may 
be accomplished by using another Bragg Cell placed at the front focal plane of a 
colliminating lens (Figure 2) . 


FOURIER TRANSFORM ANALYSIS 


At Bragg Cell Bl, a family of diffracted light beams is generated since S(t), 
the signal to be delayed, is broadband. These diffracted beams are frequency shifted 
by the respective acoustic frequencies. The direction of frequency shift is up if 
the light is incident on advancing wavefronts, down otherwise. The diffraction angle 
0j) for each beam is nearly proportional to the RF frequency. This angle equals 
0-g 4- A0 (Figure 3) where 



Sin = 2^ f B 
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where 0g = the Bragg angle for the Bragg Cell center frequency and the incident angle 
A = the free space wavelength 
v a = the acoustic velocity 
Af = f - f B 

The acoustic signal is expressed S(t + xq/v a ) exp - [j^c^ + x l/ v a^ where v a is 
the acoustic velocity and Qq the angular RF center frequency. The K vector momentum 
diagram appropriate to figure 3 (if we ignore beamspread angles) is shown in figure 4. 
An approximate expression for the amplitude of the diffracted beam outside of the A/0 
medium is then 

„ ^ xi xi 

Eq = yAd(xi , y ) S (t 4* — ) exp jmt exp - j + ex P “ 3 Ofd * r) (3) 
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where y relates to the amplitude acousto-optic diffraction efficiency and is assumed 
constant over the signal bandwidth, Ad(xi, y) is the diffracted optical beam amplitude 
distribution in the rotated XZ coordinate system (figure 5) , u) is the optical angular 
frequency, and <j> is the acoustic signal phase. 

In this discussion Aj is assumed Gaussian and is expressed 


A d (xt y ) = -j= exp - a 2 (y 2 +xf) (4) 

V TT 

where a = 2/D and D is the optical beam diameter. From figure 5 we have 

xj = -Z sin 0^ + X cos 0^ (5) 

y' = y (6) 

where 0^ is the Bragg diffraction angle outside the medium. From figure 4 we have 

k <3 * r = k(z cos - xi sin 0-j_) + Kx^ (7) 


where k and K are the free space optical and acoustic wavenumbers, respectively. 
The spatial FT of (3) may be expressed: 
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where z = -F 

F = the lens focal length 
u = X 2 /AF 
v = Y 2 /AF 

The transform along the y axis is Gaussian since 

FT(exp-a 2 y 2 ) = 1 (JL exp - (TTv/a)^ (9) 

a 

The transform along the x£ or frequency axis is the key information. This trans- 
form equals the convolution of the transforms of each x^ dependent factor. Recalling 
that K = ft/v a and that the transform of exp j a x is a delta function, the final 
result is : 

FT(E d ) = y exp j [ (u) - 0) t - <J>] exp j k F cos 0^ exp - (^~) 2 * 
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The first part of (10) indicates that at the FT or. lens back focal plan exists a 
frequency shifted (downshifted) light distribution. The next exponential term 
represents a constant phase term and is of no consequence. The v dependent term 
indicates (as already mentioned) that along the y<i axis the distribution is Gaussian. 
The 1/e total width equals 4 AF/ttD. Within the bracket is the convolution of two 
distributions. The first which is the FT of S(t+X]_/v a ) we ignore temporarily. The 
last term shows that along the x£ axis a Gaussian light distribution resulting from 
the RF center frequency exists. Its centroid is located at the X2 = AF^G/2TTv a . By 
inference, all pure sinsusoids in the signal result in frequency shifted Gaussian 
light distributions whose centroid locations vary linearly with RF frequency. Thus, 
the x2 axis is the so-called frequency axis. 

A multiplicative phase term which varies non-linearly with frequency also 
appears immediately after the convolution sign. There will, therefore, be some dis- 
tortion of the RF signal. Over a 100 MHz bandwidth for a 30 mm focal length, the 
phase distortion is less than 2 tt radians and probably not significant. Over a 10 MHz 
bandwidth the distortion is only about 0.1 radians. 

As for FT {S(t+ X|/v a ) }, if S(t) is a slowly varying function over the optical 
beam diameter D, the overall transform expression and the overall transform profile 
remain Gaussian (single frequency transform). It S is rapidly varying, e.g., pulsed 
RF signals whose duration is much less than the acoustic travel time through D, the 
shape of its transform will dominate the overall transform expression. For example, 
if the narrow pulse is rectangular (neglecting transducer and other bandwidth limita- 
tions) then the light distribution at the FT plane will tend toward a sine dependence. 

In this analysis it has been assumed that the lens axis bisects the incident 
optical beam at the Bragg Cell, as in Figure 3. This assumption is implicit in (8). 
But for achievement of variable delay, the incident beam is translated along the x 
axis and is, therefore, displaced from the lens axis. This results in an extra 
linear phase term in (10) . The term equals exp - (j 2 s u) where s is the translated 
distance from lens axis, and can be shown to equal exp - (j fl t^) where t^ equals 
the acoustic transit time between where the light intersects the acoustic beam and 
where the lens axis intersects the acoustic beam. 


SIGNAL RECONSTRUCTION 


The photomixer at the lens back focal plane is illuminated by the transformed 
diffracted light and a local oscillator (LO) spot. The LO light has not been fre- 
quency translated by Bragg Cell Bl. The LO beam intersects the FT lens at the same 
location as the diffracted beams because of the action of the 2 beamsplitters and 
2 mirrors. Also, after reflection from the second beamsplitter, the LO is approxi- 
mately parallel to the diffracted beams. This guarantees that at the photomixer, the 
diffracted and LO light propagation directions will be nearly parallel. This 
parallelism is necessary for efficient optical heterodyning. 

In particular, it can be shown that the photomixer current is proportional to 
sine (L sin a/ A) where a equals the misalignment angle between the 2 optical beams and 
L equals the spot width. As an example, if L = 10 microns, then a must be less than 
0.9° so that the sine function is greater than or equal to 0.9. A = 0.63 microns was 
assumed. An analysis which further describes the signal reconstruction at the 
photomixer is given in Ref . 2 . 
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Insertion Loss 


The insertion loss expressed in decibels is the sum of the gains (losses) at 
the acoustic transducer, Bragg Cell, photomixer and beamsplitters. Figure 6 illus- 
trates the system and includes some calculated and observed insertion loss data. The 
beamsplitter losses are included in the Bragg Cell and photomixer interactions. The 
acoustic transducer gain is assumed to equal -6 dB . The gain of the Bragg Cell can 
be expressed as 


„ _ A? M 2 LP LASER 

G 2 “ V 2H 

where M 2 is its A/0 figure of merit, L/H is the ratio of acoustic beam width to 
height and PpASER equals the laser power. 

The gain of the photomixer can be expressed (Ref. 3) as 

„ /He . 2 P LASER Req 

g 3 = C— ) 7 ~ 

J hv 4 m 

where p = the quantum efficiency 
e = the electronic charge 
h = Planck T s constant 
v = the optical frequency 

R e q — the equivalent resistance of the photo detector and output circuit 
1/m = the fraction of the available LO power which overlaps the diffracted 
light 

Figure 6 shows that the insertion loss varies inversely as the square of the 
laser power . 

System Bandwidth 


The bandwidth, assuming it is not limited by the Bragg Cell, is determined by the 
LO distribution width along the frequency axis at the photomixer. Figure 7 illu- 
strates this characteristic. The LO spot width is essentially diffraction limited. 
Therefore, by narrowing the LO beam before it is focused by the FT lens, an increas- 
ingly larger spot may be obtained. This arrangement is not shown in Figure 1. 


The bandwidth equals the ratio of the local oscillator spot width to the fre- 
quency gradient along the frequency axis. For a non-beams teered Bragg Cell trans- 
ducer, assuming normal Bragg diffraction, this gradient is expressed 


Ax 2 
Af 



(13) 


The 1/e Gaussian spot width equals 4 AF/ttD where D equals the local oscillator 
beamwidth. Therefore, the bandwidth is expressed as 


4AF/ttD _ 4_ v a 
FA/v a 7 T D 


(14) 


A typical bandwidth is 8 MHz. Here we assumed v a equals 4000 m/sec and D equals 
0 . 6 mm . 
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Demonstrated Results and Further Goals 


Demonstrated results include: 
o 0.1 to 0.5 microsecond variable delay 
o 5 MHz bandwidth (center frequency = 80 MHz) 
o 70 dB insertion loss. 

The immediate goals are: 
o reduction of insertion loss 
o increase in bandwidth 
o achievement of longer delays. 

A desirable long term goal is implementation using, not discrete components, 
but integrated ones. The benefits of so doing are well known and relate to align- 
ment stability, size, and long term cost. 

Applications 


We close this paper by listing some areas where wideband variable delay is of 
value. These areas include: 

o Adaptive Array Antenna Processing 

o Radar Jamming (Range Gate Pulloff) 

o Communications - Suppress Multipath Effects 

o Spread Spectrum Intercept Receiver - Feature Detection 

o Processing of Radar Signals. 
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MONOLITHICALLY INTEGRATED ACTIVE OPTICAL DEVICES* 


J. Ballantyne, D. K. Wagner, B. Kushner and S. Wojtzcuk 
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Ithaca, NY 14853 


SUMMARY 


We discuss considerations relevant to the monolithic integration of optical 
detectors, lasers, and modulators with high-speed amplifiers. Some design 
considerations for representative subsystems in the GaAs-AlGaAs and GalnAs-InP 
materials systems are described. Results of a detailed numerical design of an 
electro-optical birefringent filter for monolithic integration with a laser diode is 
described, and early experimental results on monolithic integration of broadband 
MESFET amplifiers with photoconductive detectors are reported. 


INTRODUCTION 


A number of workers have been involved in the monolithic integration of ^ 
detectors and lasers with amplifiers for applications in optical communication. 

The work reported here is specifically aimed at circuits and subsystems suitable 
for use in very-high data rate applications. The OPFET class of photoconductive 
detectors in III-V compounds shows picosecond response time coupled with internal 
gain and low noise. In order to realize the potential of these detectors in a 
practical system, monolithic integration with high-speed, low-noise preamplifiers is 
probably a necessity. In addition, there is a need for integration of laser diodes 
with high-speed amplifiers in order that complete opto-electronic systems can be 
constructed. 

Frequency tunable lasers are useful for FM communication, for frequency 
multiplexed systems, and offer outstanding potential for very-high data rate 
transmission. These may be constructed by suitable combinations of gain, 
birefringent and electro-optical components in the laser cavity. ^ >4 

Two materials systems seem especially attractive for monolithic integration of 
opto-electronic subsystems. They are the GaAs-AlGaAs and GalnAs-InP hetero- 
structure systems. The first system is useful because well-characterized opto- 
electronic components and amplifiers have been constructed in this system, and much 
is known about the materials problems which must be overcome to construct 
successful subsystems. The GalnAs-InP heterostructure system, lattice-matched to 
InP substrates, is interesting because of its potential application for longer 
wavelength communications, where optical fibers show minimum dispersion. A 
further reason for interest in this system is the very high electron saturation 


''Portions of this work were supported by NASA Langley Research Center, The 
National Science Foundation, and the Joint Services Electronics Program at Cornell 
University. Facilities of the National Research and Resource Facility for Sub- 
micron Structures and the Materials Science Center at Cornell University were 
employed in this work. 
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velocity, which was shown to be the greatest for any of the GalnAsP quaternaries 
grown lattice-matched to InP. In previous work we showed that the electron 
saturation velocity in GalnAs is nearly double that of GaAs for comparable carrier 
densities at room temperature . ^ This enables one to construct photoconductive 
detectors with higher internal gain, and transistors with higher G and speed in the 
GalnAs system than in the conventional GaAs system. A drawback to the GalnAs-InP 
system is that unsolved materials problems must be overcome to construct successful 
monolithic opto-electronic circuits, and the individual devices are much less 
developed in this materials system, particularly the transistor technology. 


SOME DESIGN CONSIDERATIONS FOR MONOLITHIC OPTO-ELECTRONIC SUBSYSTEMS 


Monolithic integration of optical and amplifying devices requires semi- 
insulating substrates if the well-developed MESFET technology is to be utilized for 
amplifiers. Such substrates are readily available in both GaAs and InP; however, 
problems immediately arise for MESFET type transistors on InP substrates. From 
initial considerations, it might seem that a simple GalnAs, InP heterostructure 
interface would be useful for both optical and electrical confinement, and would be 
suitable for construction of MESFETs. One of the problems with such interfaces 
grown by MBE is shown in Fig. 1 , ^ where it is seen that in an abrupt GalnAs-InP 
interface substantial leakage currents can flow which prevent construction of 
good transistors. In these MBE grown layers the interfacial currents were linked 
with the growth process for the interface, which may result in an InAsP inter- 
facial layer with poor electrical characteristics. These problems were initially 
solved by incorporating A1 into the system to provide an AlInAs interfacial layer 
between the GalnAs active region and the InP substrate. 


While this approach solved problems with leakage current in transistors, it 
raises further problems for laser diodes. A schematic diagram of a proposed 
heterostructure system for monolithically integrated opto-electronic circuits on 
indium phosphide substrates is shown in Fig. 2. The structure indicated 
schematically there has the potential for producing good transistors for amplifiers, 
good optical detectors, and good lasers. The requirements for OPFET type detectors 
are similar to those of the MESFETs, which are that one be able to construct low- 
leakage Schottky barrier gates on active layers which are thin enough to be pinched 
off at a reasonable voltage, and that the structure show no excessive leakage 
currents through interfaces. The structure shown in Figure 2 meets these require- 
ments. The AlInAs layer under the gate serves to make a good Schottky barrier as 
was previously demonstrated . ^ Fig* 2 is schematic in the sense that details of 
the ohmic contacts and Schottky barriers are not shown. Under the contacts, the 
undoped upper layers must either be removed, doped, or consumed in the contact 
alloying process. There are various approaches for doing these steps, the details 
of which are not discussed here. The requirement for pinching off the active FET 
layer at a reasonable gate voltage means that the thickness of the layer must be 
0.2 microns or below for a doping of lOl^cm*"^, or that the doping must be reduced 
for thicker active layers. 


The AlInAs layer between the substrate and active layer serves to reduce the 
leakage currents in the transistor to reasonable values. It also serves to 
provide excellent carrier and optical mode confinement for the laser structure. 
One advantage of this material system for the TJS laser structure shown is that 
the cladding layers can be easily made semi-insulating, providing excellent 
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current confinement in the TJS laser diode. Another advantage of the structure 
shown for the TJS laser is that it incorporates two-dimensional index guiding, 
which should give extremely good mode confinement. The parallel metal stripes 
on the surface serve as a slot waveguide to confine the lasing mode in the direction 
parallel to the surface. We have done extensive calculations and modeling on 
various geometries of these slot waveguides , some of the results of which are 
included in the following section. 

The problem with the AlInAs interlayer is that in cases studied to date, it 
reduces the quality (in terms of luminescence linewidth and efficiency) of the GalnAs 
active layer. ® However other workers have shown that this is not a fatal 
limitation? Substantial materials work remains to be done to solve this problem. 


SLOT WAVEGUIDES FOR MONOLITHIC INTEGRATION 


The slot waveguide configuration shown schematically in Fig. 2 appears to be 
extremely promising for monolithic integration in III-V compounds. The reasons for 
this are the experimental demonstration of stable guiding with very low threshold 
injection lasers in the AlGaAs heterostructure system^, the compatibility with TJS 
lasers as shown in Fig. 2, and the obvious compatibility with electro-optic 
devices employing planar surface electrodes. 

We have done extensive numerical calculations on the properties of slot wave- 
guides in various geometries, and confirm that excellent low-loss waveguides can be 
made in various III-V compound heterostructure systems. These waveguides contain 
as an inherent component the electrode structures necessary for inducing electro- 
optic effects, or pumping TJS lasers. A further advantage of this waveguide 
geometry is that it is compatible with optical pumping, which is convenient for 
fundamental investigations. The basic geometry of the slot waveguide system we 
analyzed is shown in Fig. 3. The long direction of the slot is oriented along a 
<110> direction so that electric fields applied by the electrodes are along along 
<110> directions as required for usable electro-optic devices. The various guide 
geometries we analyzed are shown in Fig. 4. These waveguides are not only useful 
to provide a passive guide for a laser structure, but are also suitable for combining 
an electro-optically active device within a laser cavity, such as a Q-switch 
or a tunable birefringent filter (which will provide a frequency tunable diode 
laser) . 

We previously demonstrated the feasibility for rapid electro-optic frequency 
tuning of diode lasers , ^ and are now constructing a monolithically integrated 
version of this system. While the simple slot waveguide geometry shown as Type I 
can provide good guiding and low enough loss for a laser cavity, additional 
considerations arise when one considers electro-optic devices. (See Fig. 5 for the 
results in the simple system) . Two additional considerations of importance for 
electro-optic devices are achievement of phase matching between TE and TM modes so 
that good conversion efficiency can be obtained on application of an electric 
field, and the requirement for some electro-optic structures that both TE and TM 
modes exhibit low loss. These requirements led to the investigation of the other 
waveguide geometries shown as Types II through VI. The only geometries which meet 
both of these requirements, of good phase matching and low loss for TE and TM modes, 
are Types V and VI, with Type VI being the preferred guide. As shown in Figs. 6 
and 7, both of these systems can provide good phase matching; however, the Type VI 
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guide appears to have relaxed fabrication tolerances as compared to Type V. The 
fabrication tolerances are very sensitive functions of the Aluminum content in the 
cladding layers. Decreasing A1 content relaxes these tolerances. The mode con- 
finement for a representative Type VI guide is shown in Fig. 8 and the electro- 
optic coupling efficiency for TE to TM modes is shown in Fig. 9. As shown in Fig. 5, 
the simpler waveguide types I-IV show a great difference between loss for TE and 
TM modes. Such a structure would be particularly useful for Q-switching a diode 
laser in a monolithically integrated configuration. 


INTEGRATED SUBSYSTEMS 


In Figs. 10 and 11 we show the circuit diagram and photograph of a finished 
circuit of a monolithically integrated OPFET detector and three stage wide band- 
width amplifier. This system has been designed to give a voltage gain of 7, a 
power gain of 250 (over 23 dB) , and a frequency response exceeding 5 GHz. One 
unique feature of this chip is the inclusion of a new type of interdigitated OPFET 
detector shown in Fig. 12. The areal response of this detector is shown in Fig. 13 
which is the photocurrent due to a HeNe laser spot, raster scanned over the 
detector. The low frequency gain measured in this experiment was 14, which is most 
likely a factor of 2 larger than would be measured at high speeds. Such high speed 
measurements are now in progress. In Figs. 14 and 15 we show the circuit diagram 
and layout for an integrated laser /driver-transistor circuit being fabricated in 
the materials structure shown in Fig. 2. The two small FETs connected In series on 
the gate of the driver transistor provide a convenient, area-and-power-ef f icient 
method for setting the bias level on the laser diode. 


CONCLUSIONS 


In this paper we have identified what we consider to be viable approaches for 
the monolithic integration of detectors, lasers, amplifiers, and electro-optic 
components in two material systems which operate in quite different wavelength 
regions. For the first time, we have carried out extensive numerical calculations 
on slot waveguide structures in III-V compounds , and have computed parameters 
which are useful for the design of guides for electro-optic components in the 
systems. The results of these computations give information on the confinement of 
various geometry guides, the attenuation and the electro-optic coupling parameters 
of these guides. Finally, we have fabricated the most complex monolithic integrated 
opto-electronic circuits so far reported in III-V compounds, and have verified 
their performance at low frequency. We expect these circuits to give good operation 
at gigahertz speeds. 
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I ( V) curve of a GalnAs OPFET grown on an InP substrate by MBE . R 
in current at end of trace is due to leakage through the substrate 
epilayer interface. 
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Circuit diagram for an integrated laser-driver circuit being 
constructed in materials systems of Fig, 2. 
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SUMMARY 


The MicroChannel Spatial Light Modulator (MSLM) is a versatile, highly-sensi- 
tive, and optically addressed device that is being developed for real-time optical 
information processing. The MSLM operates by converting an input optical image into 
a charge distribution at the surface of an electro-optic crystal. The charge distri- 
bution generates an electric field which modulates the refractive index of the 
crystal and thereby the phase or intensity of an image readout beam. Prototype 
devices employing 250 pm thick crystals have exhibited a spatial resolution of 
5 cycles/mm at 50% contrast, an exposure sensitivity of 2.2nJ/cm^ and framing rates 
of 40Hz with full modulation depth. The image processing operations that have been 
achieved using the internal processing mode of the MSLM include contrast reversal, 
contrast enhancement, edge enhancement, image addition and subtraction, analog and 
digital intensity thresholding, and binary-level logic operations such as AND, OR, 
EXCLUSIVE OR, and NOR. 


INTRODUCTION 


The MicroChannel Spatial Light Modulator (MSLM) is a relatively new, versatile, 
highly sensitive, and optically addressed device that is being developed for low- 
light-level real-time optical information processing. In addition to real-time 
operation, it can be designed to exhibit long term (months) optical information 
storage simultaneously. 

A variety of sophisticated image processing operations can be achieved using 
the internal processing mode of the MSLM. These include contrast reversal , contrast 
enhancement, edge enhancement, image addition and subtraction, analog and digital 
intensity level thresholding, as well as binary-level logic operations such as AND, 

OR, EXCLUSIVE OR, and NOR. This paper presents an update of its characteristics. 

The MSLM is illustrated in Fig. 1. It consists of a photocathode, a micro- 
channel plate (MCP), a planar acceleration grid (optional), and an electro-optic plate 
in a proximity focussed configuration. The electro-optic plate generally carries a 
high-resistivity dielectric mirror on one side and a transparent conducting electrode 
on the other. The MCP is an array of small ("10 pm) semiconducting-glass pores, each 
functioning as a continuous dynode electron mul tipi ier^ Spatially uniform gains 
(+5% variation) of 10^ with a single MCP or 10^ with two cascaded MCPs (chevron 
configuration) are typical 5 
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In operation, the write light (coherent or incoherent) incident on the photo- 
cathode creates an electron image which is amplified by the MCP and proximity focused 
onto the surface of the electro-optic plate. The resulting surface charge distribution 
creates a spatially varying electric field which modulates the refractive index of 
the electro-optic plate (crystal). Thus the readout light, which makes a double pass 
through the crystal, is spatially phase modulated. 


It is the wide choice of photocathodes and electro-optic materials that gives 
rise to the spectral versatility of the MSLM and permits its characteristics to be 
tailored for specific applications. For example, the wavelength of the write signal 
is limited only by the spectral response of existing photocathodes. No photocathode 
is needed in the extreme ultraviolet and X-ray regions since the MCP responds direct- 
ly to this radiation5>6. The readout light is limited by the spectral transmittance 
of electro-optic materials (the range between the middle ultraviolet and the middle 
i nfrared) . 


Because of its large electron gain and ability to integrate charge, the MSLM is 
capable of achieving the high photon sensitivities of image intensifier tubes. 

Devices employing one MCP are expected to be able to record visible images with inten- 
sities as low as 10 "^ y/ cm Z^ the primary limitation being MCP dark current. 


PRINCIPLES OF OPERATION 


The dynamics of the write and erase processes in the MSLM can be understood 
from a study of the electron current in the gap between the grid and the crystal. 

The gap current density Jg depends on the voltage V g at the crystal surface, the 
secondary emission characteristics of the crystal surface and on the MCP output 
electron energy distribution. 

Figure 2 summarizes the dependence of the gap current on the above-mentioned 
parameters. The parameter S' 1 (V|<) is an effective secondary electron emission 
coefficient (for electrons removed from the crystal) that takes the MCP energy 
distribution and the collection geometry into account. The point E in Figure 2 is 
an equilibrium point, and M is the point at which electrons can be removed from the 
crystal at the fastest rate. If the device is driven into lockout regime by lowering 
V b too rapidly, the crystal surface repels the primary electrons from the MCP and it 
becomes very difficult to alter the state of the device. 

The MSLM is therefore operated so that it is either at the equilibrium point E 
in Fig. 2, or it is moving along one of the trajectories of Fig. 2 toward equilibrium. 
For materials exhibiting the linear electro-optic effect, two write modes are possible. 
In the electron deposition write mode, V 5 initially divides capacitively between the 
crystal and the gap, thereby raising Vg into the region for which 6 '< 1. Then when 
the write light is turned on, electrons are deposited on the crystal, Vg begins to 
fall toward its equilibrium value V e q, and the crystal voltage V x rises. Electron 
accumulation may be terminated when the desired V x is reached by turning off the 
write light. The crystal may be erased via secondary electron emission by flooding 
_the photocathode with light and lowering Vg (by lowering V^) into the region where 
6 1 >1 . In practice, V5 is ramped downward no faster than Vg can be restored to V e g by 
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electron removal, so as to avoid lockout. 

In the secondary electron emission write mode, with the write light on, is 
ramped downward at a rate chosen so that lockout is avoided and the appropriate 
electron distribution is removed from the crystal . The secondary emission write 
process terminates when either the write light is turned off, or Vb reaches its final 
value and Vg returns to its equilibrium value. To erase the information, can be 
increased abruptly and the photocathode floo.ded with light so that the crystal gains 
electrons. 

Optical information that has been written into the MSLM may be read out either 
interferometrically with schlieren optics or with crossed polarizers. The possible 
readout modes depend on the symmetry and cut of the electro-optic crystal. The read- 
out mode that is used in the prototype devices employs the interference between the 
front and back surface reflections from a plane-parallel electro-optic crystal. If 
dielectric mirrors are deposited on both sides of such a plane-parallel crystal to 
form a Fabry-Perot interferometer of finesse F, a complete off-to-on transition can 
be achieved with approximately 1/F times the halfwave surface charge density of the 
crystal . 

The MSLM can be either framed through a discrete sequence of write, read, and 
erase periods or operated in a continuous mode. In the framed mode, the optical 
write signal is integrated and stored in the form of a charge distribution; after the 
modulation is read out, the charge distribution is removed. In the continuous mode 
of operation, which can be achieved with low resistivity crystals, the instantaneous 
modulation is proportional to the write image intensity for all temporal variations 
within the bandwidth of the system. 

CHARACTERISTICS OF PROTOTYPE DEVICES 


The most recent vacuum-demountable prototype devices built at M.I.T. employed 
0.25 mm -thick oblique-cut LiNbOg crystals coated on one side with a dielectric 
mirror and on the other with a transparent indium-tin-oxide electrode. A 25-mm 
diameter MCP was used in these devices, and the metal electrode on the input face 
of the MCP served as the photocathode for the 254 nm write light. 

The spatial resolution of these devices was approximately 5 cycles/mm at 50% 
contrast, and image degradation after several hours of storage time was negligible. 
Figure 3 shows how the spatial resolution of the MSLM depends on device geometry, and 
Fig. 4 shows a sample of some of the image processing operations that have been 
achieved with a vacuum demountable device. The sequence of steps for achieving these 
operations are described in Ref. 1. 

In an earlier vacuum demountable device employing a Z-cut LiNbOg Fabry Perot 
crystal, a framing rate of 40Hz at full modulation depth was achieved. Additionally, 
we have observed an exposure sensitivity of 2.2 nJ/cm^ at 655 nm wavelength in a 
vacuum sealed device employing a bialkali photocathode and a Z-cut LiTaOg crystal 
0.5 mm thick. This device exhibited extremely long image storage times as shown by 
the history of image degradation shown in Fig. 5. 
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COMMENTS 


The resolution characteristics of the MSLM have been improved by reducing the 
thickness of the crystal. Further improvements in resolution are expected in the 
near future. The large class of image processing operations that can be achieved 
by internal processing and the demonstrated potential for kilohertz framing rates are 
expected to make the MSLM useful for a large number of signal processing operations. 
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Figure 2.- Sketch of gap current density in an MSLM as a 

function of gap voltage for constant write light intensity 
I and MCP bias voltage -V . 
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Figure 3.- Calculated normalized charge sensitivity S o (w)/S o (0) 
plotted against spatial frequency oj for an oblique-cut LiNbO 
MSLM with crystal thickness and gap width as parameters. 
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Figure 4.- Sample of some of the image processing operations achieved with the MSLM. 


















OPTICAL DATA LINKS USING LONG WAVELENGTH TECHNOLOGY * 

D. J. Channin 
RCA Laboratories 
Princeton, NJ 08540 

The replacement of short wavelength (0.82 to 0.86 pm) sources by long 
wavelength (1.1 to 1.6 pm) offers opportunities for data links with enhanced 
performance and improved reliability. Performance improvement comes from the 
reduced Rayleigh scattering at long wavelengths and the opportunity to set the 
source wavelength at the fiber dispersion minimum. Reliability comes from two 
factors: (1) the potential reduction in damage inherent in lower photon 

energies and (2) the opportunity to replace lasers by LED's for many high 
performance applications. This paper will focus on the latter approach. 

This paper will give a review of RCA's InGaAsP/InP long wavelength (1.1 to 
1.7 pm) laser diodes, LED's and PIN photodiodes. Performance character!' sties 
of these devices will be presented with emphasis on their capability for data 
link service. At the present time, the use of edge-emitting LED's is favored 
for moderate link lengths such as are found in distributed computation and data 
base management systems. 

Analysis of data link performance using present and projected components 
will be given. Figure 1 indicates the comparative performance of 0.84 pm lase 
diode/Si APD data links and 1.3 pm LED/InGaAsP PIN systems. Effects of 
dispersion are taken into consideration. The conclusion is that for service 
below 300 Mb/s, LED based fiber optic systems can better the performance of 
laser diode technology. 


* Because this paper was not available at the time of publication, only the 
abstract is included. 
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Figure 1.- Transmission characteristics of fiber optic data links 





INFRARED FIBER OPTIC FOCAL PLANE DISPERSERS 


John H. Goebel 

National Aeronautics and Space Administration 
Ames Research Center 


SUMMARY 


Middle and far infrared focal plane detectors are currently less inte- 
grated in their design than are those in the near infrared. The difficulties 
of low temperature processing steps on the ultrahigh purity germaniun detector 
material, which is required in the far infrared, limit the realization of inte- 
grated circuit technology in the near future. As an alternative approach, we 
discuss the utilization of far infrared transmissive fiber optics as a compon- 
ent in the design of integrated far infrared focal plane array utilization. A 
tightly packed bundle of fibers is placed at the focal plane, where an array 
of infrared detectors would normally reside, and then fanned out in two or 
three dimensions to individual detectors. Subsequently, the detectors are 
multiplexied by cryogenic electronics for relay of the data. 

A second possible application is frequency upconversion (v, + v 2 = v 3 ), 

which takes advantage of the non-linear optical index of refraction or certain 
infrared transmissive materials in fiber form. Again, a fiber bundle is uti- 
lized as above, but now a laser of frequency v^ is mixed with the incoming 
radiation of frequency v« within the non-linear fiber material. The sun, v 3 , 
is then detected by near^infrared or visible detectors which are more sensi- 
tive than those available at v ? . Due to the geometrical size limitations of 
detectors such as photomultipliers, the focal-plane dispersal technique is 
advantageous for imaging upconversion. 
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INTRODUCTION 


Currently, real i zations of instriments based upon infrared upconversion 
are limited by the availability of large high quality non-linear crystals. I 
describe here how fiber bundles of the non-linear material can substantial ly 
eliminate the availability problem. Such "pixelized" crystals, i.e., the 
fiber bundles, can considerably increase the conversion quantum efficiency 
available at low pump power densities. The integration of optical components 
thru the use of fiber optics can considerably add to the ruggedness and use- 
fulness of instruments based on their incorporation when compared with those 
utilizing discrete components. 


DISCUSSION 
Linear Fiber Optics 


Infrared focal planes are now in a phase of extreme integration pressure. 
Users are not satisfied with a few good detectors which scan a scene to there- 
by construct a picture. Rather the availability of microcircuit technology 
has created a large demand for CCD or CID focal plane arrays which "pixel ize" 
a scene with considerable facility and accuracy. The cost investment to pro- 
duce custom arrays is considerable to the point of being prohibitive in many 
instances. Thus only a few standard array formats have become available, and 
those are in silicon for the visible and HgCdTe for the infrared. 

In certain imaging applications the detector pixel size is not matched 
conveniently to the spatial resolution of the telescope. Several factors 
affect the spacial resolution; most notably they are "seeing", diffraction, and 
optical quality. Any of these can severely degrade the resolution so that 
several detectors measure several spatial resolution elements or vice versa. 
This results in a loss of information. 

In Figure 1 we show a proposal for coupling dissimilar spatial resolution 
scales so that in effect they are equivalent. A conical fiber bundle of 
appropriate length is matched at one end to the spatial resolution scale of 
the focal plane and at the other end to the detector array scale. Note that if 
the detector size is much smaller than the resolution size, care must be exer- 
cised in the cone design so as not to totally retroreflect the light in the 
cone before it exits onto the detectors. 

The scheme outlined in Figure 1 can be used advantageously in the 
visible, near IR, and middle IR, where integrated circuit technology produces 
suitable detector arrays. In the far IR, the silicon and HgCdTe detectors do 
not function. There are, so far, only germanium based detectors which are 
physically sizeable as well as discrete. In this case a solution is also 
possible using fiber optics of appropriate materials and is shown in Figure 2. 
Suitable fiber materials are KRS5 and AgBr. The cone is tied at the focal 
plane end, while the detector end is free as in a broom construction. Indi- 
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vidua 1 detectors are then free to be placed over an appropriate spatial scale 
with due consideration to such ancillary components as integrating 
spheres, preamps, load resistors, and stressing clamp. It should be noted 
that fiber bundles are advantageous in space applications for routing focal 
plane images into compartments shielded from cosmic rays. 

Nonlinear Fiber Optics 


One of the really promising developments in optical materials in the last 
decade is the explosion of our knowledge about the properties and applications 
of non-linear optical crystals. Since non-linear properties were first 
investigated, it has been realized that one of their truly original 
applications is in the conversion of infrared radiation into visible where 
extremely sensitive detectors are available. There have been several success- 
ful instruments built for this purpose, but all suffer from one or another 
limitations imposed by the availability of pump sources or that of the requi- 
site non-linear material. It appears that some of these major problems can be 
alleviated through the introduction of non-linear fiber bundles as proposed 
here. 


First I'll list the basic theoretical description of non-linear upcon- 
version. If photons of two different frequencies, v^ and Vg are simultaneously 
propogating thru a uniaxial crystal with an ordinary index of refraction n° 
and an extraordinary index n , then the crystal can be rotated to a specific 
angle 0 , the mixing angle, such that the laws of conservation of energy and 
momentum for the waves can be simply related as 


v, + v~ = Vo energy conservation 
and J; J 

kj + = k^ momentum conservation 

i.e., a third wave at Vg is produced. Polarization must also be con- 
sidered and this leads to two types of momentum conservation or phase matching. 

For Type 1 mixing both waves have the same ordinary polarization 
n °l v l + n °2 v 2 = n6 3 v 3 Ty P e * 

For Type 2 phasematching, orthogonal polarizations are mixed 

"*1 V 1 + "V? * "Vs 

For uniaxial crystals, the polarization can be expanded in terms of the 
electric field strength E 


P i ~ 2 i ,j,k d ijk E j E k 
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The non-linear interaction is proportional to the product of field of 
strength of the two waves and the crystal's effectiveness as a non-linear 
medium is measured by d . . , . Under certain assumptions about the symmetry of 
the polarization matrix ,e.g. , Kleinman symmetry, the experimental equivalent 
of d.- is d ~ . In practice it is almost impossible to evaluate d... 
theoretically. 7 ^ 


For an instrument, the practical parameter of interest is the quantum 
conversion efficiency T) 

_ 2 


n 


512 ir 5 d ff IJ 2 




sin x 


where x = anc | ^k 


k 3‘ k r k 2 


I. is the laser pimp intensity usually chosen at v~ and 1 is the 
interaction length of the crystal. The term in brackets is related to the 
coherence length of the interaction X which is dimensionless. Of immediate 
practical interest are the important Manley-Rowe relationships which govern 
the power flow from wave to wave 


AP 1 = AP 2 = AP 3 



which essentially state that 100% power conversion of infrared into 
visible photons is possible. Indeed that has been shown to be possible in the 
laboratory. 

Instruments have been demonstrated based upon different non-linear 
crystals to produce infrared upconversion with respectable NEP's and n's. Boyd 
and Townes (1977) have usecLproustite (Ag^As S-) to^make an astronomical 
imager with an NEP = 3x10" u W/\/Hz and an i] = zxlO" . Images were obtained of 
the Sun, Moon, Mercury and VY CMa . Voronin et al (19751 used AgGaSp to 
achieve 40%=t] with a high purity (o{ ( 1 . 06 yu m) < 0.1 cm -1 ) crystal able to 
ut i 1 i ze high 1 aser energy level s. Note that i\ depends only on the pimp power , 
not the signal source power. Hence the great interest in upconversion for low 
level signal detection. Gurski (1973) used a Li 10- crystal to convert 3.4/u.m 
radiation with 100% efficiency. He al so achieved an NEP = lxlO" 15 W//Hz. 

Both Voronin and Boyd were able to achieve imaging, as have other workers. 

The requi rements of high field strengths from the pump necessary to drive 
the non-linear interaction requires focusing the pump and signal source in the 
crystal to the threshold of damage, usualy at 10' s of MW/crn in presently 
available materials. Such high energies are usually available only from 
pulsed lasers of low duty cycle. This situation is a disadvantage in low 
level signal detection and leads to the apparently low n and NEP of Boyd and 
Townes. 
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From the expression for -t| , the easily controlled parameters of an up- 
converter are d 1^, and 1 . While grows 1 inearly with , it grows as the 
square of d ff fna 1. The search for materials with higher L f continues , 
but dramati^'order of magnitude leaps are very unlikely. Present val ues of 
d -- are 1 imited to about 300xl0“ y esu.g Specific appl ication may 1 imit the 
uieful d g ^ to val ues less than 100x10“ y esu. 

Refinement of material quality is more easily achieved than inventing n^w 

materials. Present absorption coefficients are usually greater than 0.1 cm" 1 , 

limiting practical crystal usage to 10 cm lengths. However, because of the 

need to focus in crystals to up the field strength, the useful portion of 

crystal is in practice much less than this figure. The spectral bandpass 4v. 

of an upconverter is determined by the crystal length and is approximately p 

iv. „ = i in cm" 1 , 
bp 

In Figure 3, I propose that fiber bundles of non-linear material can be 
used to alleviate some of the above restrictions on the performance of infra- 
red upconverters. The fibers are able to maintain a given punp intensity I, 
over greater lengths in a focused beam profile. The full length of a fiber 
can be employed usefully to increase n. If in the figure the collimated 
beams of v, and v« are focused on the crystal fiber bundle, then the image is 
preserved throughout the detection system. Longer lengths of fibers employed 
mean increased spectral resolution. Fiber bundles can also be formed from 
selected high quality lengths of fibers. 

At the moment, the most promising material available for production of 
upconversion in fibers is AgGaS^. Most all of its properties are known which 
are essential to production of such fibers. In a collaborative endeavor with 
R. Byer and R. Feigelson of Stanford University, D. Dimiduk, and myself, with 
the support of the Ames Research Center's Directors Discretionary Fund, are 
working to the realization of a fiber optical non-linear upconverter, with the 
goal in mind of achieving single photon counting in the 5-m region of the 
infrared with imaging capability. 
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E-BEAM GENERATED HOLOGRAPHIC MASKS FOR 
OPTICAL VECTOR-MATRIX MULTIPLICATION 4 " 


Steven M. Arnold 

Honeywell Corporate Technology Center 
Bloomington, Minnesota 55420 

Steven K. Case 

Department of Electrical Engineering 
University of Minnesota 
Minneapolis, Minnesota 55455 

SUMMARY 

A proposed optical vector-matrix multiplication scheme encodes the matrix 
elements as a holographic mask consisting of linear diffraction gratings. The 
binary, chrcme-on-glass masks are fabricated by e-beam lithography. This 
approach results in a fairly simple optical system that promises both large 
numerical range and high accuracy. A simple holographic mask has been 
fabricated and tested. 

INTRODUCTION 

There has recently been considerable interest in optical computing since 
it offers very high computation throughput rates for mathematical operations 
amenable to parallel computation. One class of such operations, vector-matrix 
multiplication, can be used for performing discrete Fourier transforms, 
coordinate transformations, pattern classification, and many other 
computations. The general vector-matrix multiplication may be written as: 

y m = n-1 H mn x n (m = 1,2,. . .M) 

One optical approach to performing this computation uses N light sources 
to represent the components x n of the input vector, M detectors to represent 
the components y m of the output vector, and suitable optics to assure that a 
fraction H mn of the light from source x n gets to each detector y m . The 
problem can be suitably scaled so that all parameters fall within acceptable 
positive ranges. Optics to perform the function of the matrix H generally 
will be fixed, while the sources are modulated to represent various input 
vectors x. 

In principle, the performance of this optical computer is dependent on a 
number of considerations involving the optics, detectors and sources. In 
practice, numerical range and accuracy will usually be limited by matrix 
element imperfections, while speed will be limited by the amount of light 
reaching the detectors. For this reason, our work has focused on efficient 
optics to precisely distribute light among the various detectors. 

In most schemes for optical vector-matrix multiplication, the matrix is 
encoded as a rectangular array of apertures or gray tones in a mask. This 
approach, represented in Figure 1, encounters several limitations. A complex 
optical system is required in order to illuminate and receive light from 
specific columns and rows of the matrix mask. Much light is discarded in 


+ This work was sponsored by the Air Force Office of Scientific Research under 
Contract No. F49620-80-C-0029. 
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providing uniform illumination to the mask, with the mask passing only about 
half of what remains. Numerical range and accuracy are limited by the 
space-bandwidth product of the mask (generally less than 10^ with conventional 
plotting techniques). Small matrix elements result in small apertures with 
low relative accuracies. If results differ from those intended, it is 
generally difficult to modify a mask except by starting anew. 

PCGH CONFIGURATION 

Our approach to optical vector-matrix multiplication is based upon an 
e-beam generated diffractive mask which we call a partitioned computer 
generated hologram (PCGH) (see Figure 2). Each of N PCGHs is illuminated by 
collimated light from a single element of the source array and thus represents 
one column of the NxM matrix mask depicted in Figure 1. Each PCGH is 
partitioned into M linear gratings which diffract light to the M detectors. 
The optical power diffracted by a particular grating is made proportional to 
the value of the required matrix element. 

Figure 3 illustrates a PCGH intended to produce 10 equal intensity outputs 
when uniformly illuminated. This PCGH contains 10 equal area gratings, each 
with its own spatial frequency. Facets are arranged symmetrically about the 
center to provide immunity to beam wander. 

The PCGHs are fabricated as binary chrome-on-glass holograms where the 
pattern is delineated via e-beam lithography. A glass plate is first coated 
with a layer of chrome and a layer of e-beam resist. A pattern is exposed in 
the resist by e-beam direct writing and the resist is developed. The 
developed resist then serves as a mask for etching the pattern into the 
chrome . 

Our e-beam PCGH optical vector-matrix multipliction scheme has several 
advantages over the scheme in Figure 1. E-beam lithography offers a higher 
space-bandwidth product, which can translate into greater numerical accuracy. 
Also, the PCGHs are in the Fourier plane of the transform lens with respect to 
the detectors. This means that the only requirement for light to reach a 
particular detector is that it be traveling in the right direction upon 
leaving the PCGH. Therefore, the input modules, consisting of source, 
collimating lens and PCGH, may be located anywhere within the aperture of the 
transform lens. This same immunity to shifts allows a PCGH to be partitioned 
into facets in any manner consistent with dividing up the available light 
amongst the various detectors (providing, of course, that the facets do not 
become too small). Other advantages relate to optical efficiency. All light 
striking the PCGH can be used. Light need not be wasted in achieving uniform 
illumination; non-uniform illumination is acceptable so long as its effects 
are accounted for in the partitioning. Small facets associated with lesser 
outputs can be made physically larger by placing them where PCGH illumination 
is lowest. The various considerations which go into the design of a PCGH are 
discussed in the following sections. 

PCGH DIFFRACTION ANALYSIS 

Each facet of the PCGH contains a linear grating to diffract incident 
light to the appropriate detector. The spatial frequencies of these linear 
gratings are determined by the system geometry. First-order diffracted light 
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from a facet of spatial frequency v will be focused in the detector plane a 
distance vXF from the transform lens axis (Figure l) , where A is the 
wavelength and F the transform lens focal length. Our design is for a 
10-element linear detector array. This requires 10 equally spaced grating 
frequencies. The widest possible detector separation is achieved for grating 
frequencies nAv, where Av is the frequency separation and n = 10, 11, 12, 13, 

14, 15, 16, 17, l8, and 19. Then the unwanted harmonic frequencies from the square 
wave gratings begin at 20Av and will not coincide with the desired outputs. 

MODULAR FACETS 

The matrix values are encoded into the PCGH via grating area modulation. 

The hologram must therefore be divided into facets such that the amount of 
light diffracted by a facet to its detector is proportional to the required 
matrix element. Various considerations lead us to partition the PCGH into 
facets along a square grid. 

Mathematically, the transmittance of a facet can be regarded as the 
product of its aperture and an infinite linear grating. By the Fourier 
convolution theory, the diffraction pattern of this facet is the diffraction 
pattern of its aperture convolved with the delta function from the infinite 
linear grating. In other words, the effect of the linear grating is to shift 
the location of the diffraction pattern of the facet aperture. 

Figure 4 indicates the diffraction pattern due to a square aperture of 
dimension D. The main lobe has a width 2AF/D and contains 81.5% of the energy 
passing through the aperture, which we label as 0 dB. The sidelobes form a 
rectangular array with sidelobe energy diminishing inversely as the square of 
the distance from either axis. The figure indicates the energies (in dB) of 
the various sidelobes relative to the main lobe. It is seen that their 
energies diminish most rapidly along a diagonal. 

Crosstalk between channels depends on how these diffraction patterns 
overlap adjacent detectors. As indicated earlier, the separation of detectors 
isAvAF. As facets are made smaller, their diffraction patterns become 
larger, requiring higher grating frequencies to separate them. For this 
reason, we impose a minimum square facet size, based on detector separation 
and an acceptable level of crosstalk. We size our detector aperture to 
capture only the main lobe of this minimum facet diffraction pattern. A 
larger aperture would capture lesser lobes of the channel of interest, but 
also some greater lobes of adjacent channels, thereby degrading the 
signal-to-noise ratio. Also, zero intensity at the edges of the detector 
aperture eases mechanical tolerances. 

In the diffraction pattern of Figure 4 the square box in the center 
represents the detector aperture. Similar boxes are used to indicate possible 
locations of adjacent detectors. These have been separated diagonally to use 
the more rapid sidelobe decay (implying grating fringes which run diagonally 
within the facets). A separation of 2AF/D in each dimension results in 29 dB 
of crosstalk with sane very bright axial sidelobes just outside the detector 
aperture. A separation of 3AF/D yields a much more comfortable 37 dB 
crosstalk. With this as our choice, the minimum facet size will be D = 3 V2 /av. 
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The above discussion of crosstalk implies that each facet is a minimum 
square facet. This would be a severe constraint on system numerical range. 
In practice, we form the facet for each channel from many minimum sized 
modular subfacets. Therefore, the diffraction pattern for a single channel is 
not that of a single subfacet, but rather the pattern is due to the aperture 
consisting of all subfacets for that channel. Crosstalk can be minimized by 
clustering all required subfacets of a given spatial frequency into one or two 
large facets with minimum perimeter, such as was done in the PCGH of Figure 3. 

To have the greatest flexibility in partitioning the PCGH, we would like 
the modular subfacets to be as small and numerous as possible. Since they can 
be no smaller than D = 3 1/2/ Av (with our 37 dB crosstalk limit), we want a 
large spatial frequency separation Av . However, a large spatial frequency 
separation implies large spatial frequencies and hence small grating periods. 
If the grating period becomes comparable in size to the e-beam spot size, 
considerable grating duty cycle errors with corresponding diffraction 
efficiency errors will result. For these reasons we have elected to start 
with grating frequencies of 60 to 114 lp/rran (measured along either axis) and 
0.5 mm subfacets. This gives us 400 subfacets in a 1 cm x 1 cm PCGH and a 
maximum numerical accuracy of about 20 dB. With Gaussian illumination, the 
effective number of subfacets is extended to more than 5000 since the corner 
facets have about 6 % the intensity of illumination of the central facets. 

TRIMMING METHODS 

To extend our numerical accuracy beyond about 20 dB, it is necessary to 
employ a separate lithography step to adjust the relative amplitudes of the 
outputs. The ability to do this trimming is one of the chief advantages of 
e-beam lithography and the PCGH. There are several possible methods of 
trimming a PCGH once it has been made and tested. The best trimming method is 
to add a negative facet, that is to say, a facet exactly out of phase with the 
existing facet. This is much easier than it might sound: the negative facet 
can be written in space already occupied by the existing positive facet, 
leaving a completely open area. Linewidth and phase problems are overcome 
since, if we write over the whole area, the phase and duty cycle will be 
exactly the complement of what was already there. The easiest and probably 
best way to trim, because it minimizes registration requirements , is to draw a 
square box of the appropriate size somewhere near the middle of the existing 
facet. 


PARTITIONING 

The task of partitioning the PCGH to achieve the correct relative outputs 
is greatly simplified by the decision to adopt module facets. The intensities 
of rectangular facets are easy to compute for either uniform or Gaussian 
illumination. We need only determine which subfacets are to be assigned to 
which channels. We do this by means of an algorithm which maximizes accuracy 
and minimizes crosstalk. Each subfacet is assigned entirely to one channel. 
Trimming is then used to achieve the desired numerical accuracy. With a 1 cm 
x 1 cm PCGH, 0.5 mm subfacets and a Gaussian illunination diminishing to e"2 
at the edges, a subfacet in the corner will receive only 0.00019 of the total 
energy. Therefore, it is possible to obtain as much as 37 dB of numerical 
range, with no channel receiving less than a full subfacet. 
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EXPERIMENTAL RESULTS AND DISCUSSION 


The experimental setup for demonstation and evaluation of PCGHs is 
indicated in Figure 5. Light from a He-Ne laser is passed through a pinhole 
spatial filter to create a point source which is then imaged in the detector 
plane by a lens. The PCGH is kinematically mounted in a micropositioner 
placed immediately after this lens. This mount is designed to allow removal 
for trimming and subsequent replacement of the PCGH without disturbing the 
alignment. The distance from the PCGH to the detector plane is 50 cm. In the 
detector plane, we have a single UDT-455 photodetector mounted on a motorized 
translation stage. One of several square apertures, selectable in size from 1 
to 2 mm, is placed in front of this detector to define and limit its effective 
area. A second identical photodiode directly monitors the laser output. By 
ratioing the signals of these two detectors, we eliminate a number of 
potential noise sources, including laser power fluctuations. The two detector 
signals can be fed into a logarithmic amplifier for strip chart traces and 
rough measurements, or else the signals can be measured directly when the 
greatest accuracy is required. 

Our experimental work thus far has been aimed chiefly at constructing 
experimental apparatus capable of yielding the precise measurements needed for 
verifying the 40 dB numerical accuracy and/or 40 dB numerical range we seek. 
In this regard, we consider the setup as described to be adequate with only 
minor refinements. 

We have fabricated the PCGH of Figure 3. This hologram utilizes 
diagonally separated outputs, compact facets and symmetry about the axis. 
Figure 6 shows the resultant diffraction pattern at the output plane. Since 
the grating fringes are written at 45° relative to the facet boundaries, the 
many on-axis sidelobes from each output are seen to be diagonally separated 
from the adjacent output signals. Figure 7 is an intensity trace through the 
centers of the outputs. The spacing of the outputs was selected to give 37 dB 
worst case crosstalk between adjacent channels. 

In light of our preliminary measurements, the PCGH method of optical 
computing appears capable of high numerical accuracy over a wide numerical 
range . 
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Figure 1.- A general scheme for optical vector-matrix multipliation . 
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Figure 5.- Experimental setup for demonstration and evaluation of PCGHs 


Figure 6.- Diffraction pattern from the PCGH. The undiffracted beam, all 
10 first-order diffracted beams, and the first few second-order beams 
are visible. 
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SUMMARY 


New types of holographic optical elements are demonstrated which combine some of 
the flexibility of computer generated holograms with the large space-bandwidth pro- 
duct and high diffraction efficiency of interferometrically recorded volume phase 
holograms. The optical elements are recorded by subdividing a volume hologram film 
surface into numerous small areas (facets), each of which is individually exposed 
under computer control. Each facet is used to produce a portion of the desired 
final wavefront. Three different optical elements are demonstrated. 


INTRODUCTION 

Holographic optical elements are an attractive alternative to conventional opti- 
cal elements in systems using coherent light. The attributes of holographic optical 
elements include the capability of producing low f- number diffraction limited 
optics, and the ability to generate arbitrary wavefronts. They are also potentially 
easy to replicate and have low weight and compact size. Unfortunately, not all of 
these attributes are shared by all hologram types. 

We could categorize holograms as computer generated, interferometrically 
generated, or generated by a hybrid technique as described in this paper. Computer 
generated holograms (CGH) offer the greatest flexibility in the production of 
arbitrary wavefronts. CGH*s, however, usually have limited space- bandwidth products 
and diffraction efficiency. Volume phase holograms, on the other hand, can have 
nearly 100% diffraction efficiency into a single order and very large space- 
bandwidth products (necessary for low f-number optical elements), but offer limited 
flexibility in the production of arbitrary wavefronts. 

The new type of hologram described here combines the best features of the pre- 
vious hologram types: high efficiency, large space-bandwidth product, and greater 

flexibility in wavefront generation. The principle used to design the new hologram 
is straightforward. While the interference pattern required to record an entire 
"arbitrary wavefront" volume hologram is quite complex, the required pattern within 
a small area on the hologram is relatively simple such that it can be approximated 
by planar, spherical, or cylindrical waves. Thus our new holographic optical ele- 
ment is recorded by individually exposing numerous, small volume hologram facets. 

The wavefronts used to record each facet can be different but are produced by con- 
ventional optical elements which are positioned via computer control. 


t This work supported by the University of Minnesota and by AFOSR/ARO Grant 
DAA29-81-K-0033. 
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WAVEFRONT TRANSFORMATION OPTICAL ELEMENTS 


The first type of optical elements that we describe are used for wavefront 
transformations* For this system we start with a Gaussian intensity profile laser 
beam which we wish to transform into another shape in order to optimally illuminate 
some object (for interferometric testing or materials processing). In this example, 
we assume that we must illuminate the periphery of a hollow, square box. Such a 
transformation requires two holographic optical elements as shown in Fig. 1* The 
first hologram acts to spatially redistribute the input light beam and the second 
hologram produces a desired phase front on the redistributed light so that it can 
propagate toward a distant object. 

The first hologram is recorded by subdividing the hologram surface into 
numerous, small squares (facets) each of which is individually exposed as in Fig. 2. 
The spatial frequency of the grating recorded within each facet is varied by 
suitably adjusting the object beam incidence angle before recording. The grating 
frequency within each facet is chosen such that the input beam in Fig. 1 will be 
suitably deflected to be spatially rearranged when it reaches the plane of Hologram 
#2. Design considerations for the hologram are given in Ref. 1 and will not be 
repeated here. 

The redistributed light from Hologram # 1 and a plane wave are used as the 
recording waves for Hologram #2. In this manner, Hologram #2 can rediffract the 
light arriving from Hologram #1 to produce a final output wave with the desired 
intensity distribution and a planar wavefront. Because both holograms are recorded 
in dichromated gelatin, they have nearly 100% diffraction efficiency so that the 
redistribution process is light efficient. In Fig. 3 we have photographed the light 
pattern in five equally spaced planes to show the gradual redistribution of light 
via our process. 


SPACE-VARIANT LENS 


The multi-facet hologram technique is also useful for the production of custom- 
ized holographic lenses with very large space-bandwidth products. As a demonstra- 
tion, we constructed a space-variant cylindrical lens capable of producing a line 
focus on a spherical surface. The construction of a conventional lens that would 
perform this task would be extremely difficult if not impossible. Without multi- 
facet hologram approach, however, we divide the hologram film into a set of horizon- 
tal zones (facets) and specify the focal length that a cylindrical lens wihin each 
facet would need in order to bring a plane wave to a line focus on the object sur- 
face (see Fig. k ). 

For recording, a thin, stationary, flexible mask containing a horizontal slit 
aperture is again pressed lightly against the film. Exposure is with a planar 
reference wave incident at approxiately 35° with respect to the film normal and a 
cylindrical wave. The cylindrical wave is produced by focusing a plane wave with a 
conventional cylindrical lens. The distance between the film plane and the focal 
plane for the cylindrical wave (i.e., the focal length of the holographic lens 
being recorded) is selected by moving the cylindrical lens relative to the film. 

Both the film and the cylindrical lens are mounted on mini-computer controlled 
translation stages. A facet is exposed via a computer controller shutter. To 
record the next facet, the film is translated by one facet height and the cylindrical 
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lens repositioned to achieve the prescribed focal length for this facet* The pro- 
cess is repeated until all facets are exposed* The final hologram consists of 13 
facets each approximately 19><3mm in size. 

The experimental results in Figs. 5 and 6 show the utility of our system. 
Figure 5 shows the focal pattern obtained when a conventional cylindrical lens is 
used to focus a line on a 5.kcm diameter sphere. The top of the sphere is located 
one focal length from the cylindrical lens. Figure 6 shows the improved results 
when our multi-facet optical element is used. The focal line is now sharp and the 
segmenting barely visible. 


IMAGE FORMATION 


Our last demonstration is in the use of multi- facet holograms for the formation 
of images. The hologram surface is again subdivided into numerous small facets, 
each of which contains a grating of a specified spatial frequency. When illuminated 
by a plane wave as in Fig. 7, the light is diffracted to specific locations in the 
image plane which is located a short distance (20cm) after the hologram plane. The 
image is thus built up of a number of small patches of light. 

As before, each facet is inter ferometrically recorded under computer control. 

The computer translates the film behind a mask so that individual facets can be 
sequentially exposed, and deflects the recording beam to produce the requisite spa- 
tial frequencies. 

3 

This image production method produces results similar to those from kinoforms 
in that all of the incident light is transformed into an image. The present produc- 
tion technique has the advantage, however, that one only needs to control a trans- 
verse spatial frequency instead of a longitudinal phase shift. 

A resultant image from this type of hologram is shown in Fig. 8. Here a l8l 
facet hologram is used to form a simple image. 


CONCLUSION 


We have demonstrated three types of multi-facet holographic optical elements 
which combine optical flexibility, high efficiency, and large space-bandwidth pro- 
ducts. These elements should prove useful for a large number of custom optical 
applications. 
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SEMICONDUCTOR LASER AND DETECTOR BANDWIDTH LIMITATIONS 


IN OPTICAL INFORMATION PROCESSING* 

Sverre T. Eng 

Chalmers University of Technology 
S-412 96 Goteborg, Sweden 

Semiconductor lasers and photodetectors are becoming important in compact 
time-integrating optical signal processing, optical systems performing 
arithmetic operations in parallel on arrays of binary data, fiber-optic data 
buses in hybrid optical/digital image processing systems, fast optical logic 
and detection for optical computers, and high-speed digital to analog 
converters, etc. 

The limitations in bandwidth of the semiconductor laser transmitter as 
well as the photodetection receiver are of utmost importance in evaluating 
advanced real-time optical processing system potentials. Therefore, the 
transmission limitation has been investigated in a general way by PCM 
fiber-optic link studies at the highest possible (5 Gbit/s) bit rate. 

A 5-Gbit/s optical fiber communication system has been built using TJS 
lasers and 3-Gbit/s transmission was obtained with a BH laser. A 500 m 
single-mode fiber was used. The laser modulation was limiting the bit rate. 

The rest of the system was capable of at least 10 Gbit/s. Bit error rates of 
10~9 were measured and proved the feasibility of the system. The transmitter 
consisted of three parts: a multiplexer, a laser driver circuit, and a laser. 

In this experiment channels with a bit rate of 100 Mbit/s were multiplexed to a 
maximum bit rate of 5 Gbit/s. Three channels were used to carry information 


* Because this paper was not available at the time of publication, only the 
abstract is included. 
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and one to transmit the clock signal to the receiver. The 100-Mbit/s input bit 
stream can have a nonreturn-to-zero of return-to-zero data format. The 
receiver consisted of an avalanche photodiode (BPW 28 from Telefunken), a 
MESFET low-noise preamplifier, and a demultiplexer. At a transmission path 
following the receiver, AND gates are mounted with a spacing equal to the 
inverted bit rate. The AND gate has a switching speed in the 10-Gbit/s range 
and consists of GaAs dual -gate MESFET transistors. 

Optical transmission at 8 Gbit/s has been performed earlier with TJS 
lasers (ref. 1). However, 5 Gbit/s may be a better practical choice at the 
present time. 
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A LIQUID CRYSTAL ADAPTIVE LENS 


Stephen T. Kowel* 

Syracuse University 

Department of Electrical and Computer Engineering 

Syracuse, N.Y. 


Dennis S. Cleverly 
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SUMMARY 


Creation of an electronically controlled liquid crystal lens for use as a focus- 
ing mechanism in a multi-element lens system or as an adaptive optical element is 
analyzed. Varying the index of refraction is shown to be equivalent to the shaping 
of a solid refracting material. Basic characteristics of liquid crystals, essential 
for the creation of a lens, are reviewed. The required variation of index of refrac- 
tion is provided by choosing appropriate electrode voltages. The configuration re- 
quired for any incoming polarization is given and its theoretical performance in 
terms of modulation transfer function derived. 


INTRODUCTION 


Lenses play a critical role in almost all optical processing systems. From the 
crude and inexpensive plastic lens to the diffraction-limited lens, their role in the 
optical world is unique. The focusing of a lens is always accomplished by mechanical 
motion of one portion of the lens relative to the desired focal plane or correspond- 
ingly to other elements of the lens. This mechanical movement, to be precise, re- 
quires sophisticated and expensive mechanical hardware. 

This paper addresses the electronically controlled creation of a lens. Since 
the actual focusing action is provided by electrode voltages, the image formation can 
be controlled. Also, since the index of refraction is controlled point-to-point on 
the lens, adaptive performance can be provided as well. 


* Supported by the U.S. Army Night Vision and Electro-Optics Laboratory, Fort 
Belvoir, Virginia, Contract No. DAAK-70-80-C-0053. 
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SYMBOLS 


f 

k 

L 

n(x,y) 


n 

e 


n 

o 


R, 


U (x, y) 


V 


T 


A- 


£ 1 

£ ii 


focal length (meters) 

wave number (meters 

diameter of lens (meters) 

position variable index of refraction 

extraordinary index of refraction 

ordinary index of refraction 

angle variable index of refraction 

radius of curvature (meters) 

field distribution 

threshold voltage (volts) 

cell thickness (meters) 

component of the dielectric tensor perpendicular to the molecular axis 
component of the dielectric tensor parallel to the molecular axis 

LENS CREATION BY INDEX OF REFRACTION VARIATION 


Lenses are classically created by varying the optical path length over an aper- 
ture by radially shaping a refractive medium. The same phase transformation can be 
obtained if the material has parallel faces but a varying index of refraction. 

Consider the geometry shown in figure 1, where a material of width A and vari- 
able index of refraction (ref. 1) 


n(x,y) = n g 


2 

jr 

2Af 


2 

r 


2 

x 


+ y 


(i) 


is centered on the z-axis and lies in the x-y plane. Making the same approximation 
as in a phase analysis of a thin lens (ref. 2), namely, 


1) Light passing through the device suffers only a phase transformation 

2) Light rays are paraxial, that is. 



2 

x + y 



2 


( 2 ) 
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results in a transmission function 


where 



(3) 

(4) 


LIQUID CRYSTAL CHARACTERISTICS 


All crystals under the influence of an electric field or applied stress exhibit 
birefringence (ref, 3). That is, the index of refraction can he varied. In solids, 
however, this effect is small -since only distortions of the indicatrix are made. 
Liquid crystals are unique since the molecules reorient under the influence of an 
applied field, either electric or magnetic (ref. 4). In addition, nematic liquid 
crystals are uniaxial and the optical axis is coincident with the molecular axis. 
Thus, as the molecules reorient in response to an applied field, the entire indica- 
trix is rotated, making large changes in the index of refraction. 

Liquid crystals can be divided into two groups* If the dielectric tensor is 
such that the component along the molecular axis (£jj) is greater than the component 
perpendicular to the axis (£j_) , the crystal is said to be positive. These molecules 
tend to align parallel to an applied field. If the reverse is true, e (| < the 
crystal is said to be negative. In this case, the molecules will align perpendicu- 
larly to an applied field. Both types of liquid crystals are positive uniaxial 
crystals. The extraordinary index of refraction, n e (along the optical axis which is 
coincident with the molecular axis), is greater than the ordinary index of refrac- 
tion, n 0 . 

Figure 2 illustrates the changing of index of refraction with orientation of the 
liquid crystal molecules. In this example, a liquid crystal is shown at the top of 
the figure in the homogeneous state. Correspondingly, the indicatrix is aligned so 
that a wave polarized in the x-direction is influenced by an index of refraction n e . 
At the bottom of the figure, the orientation has been altered to the homeotropic 
state, in which case the same polarized light encounters an index of refraction n Q , 

At an intermediate point, the index of refraction is dependent on 0, the direction of 
the nematic director, and is given by (ref. 1) 


cos^Q ^ sin^Q 




n 


n 


(5) 


Two important points should be noted. First, only the light polarized in the x- 
direction was influenced by a changing index of refraction. Light polarized in the 
y-direction suffered a constant phase delay. Because independent of the molecular 
orientation in the x-z plane, it was influenced by a constant index of refraction, 
n D . The second important point is that the liquid crystal molecules do not align 
uniformly across a cell. As the voltage is increased above threshold (ref. 5), the 
molecules near the center react to the field first. At saturation, all of the mole- 
cules have responded except those tied by boundary conditions at the cell wall. 
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Figure 3 illustrates the index of refraction variation with voltage in one par- 
ticular material. 


LIQUID CRYSTAL LENS 


Figure 4 shows a single stage of a liquid crystal lens. With voltages appropri- 
ately set on the electrode array, light polarized in the x-direction would be given 
the correct phase transformation. The y-polarized component would suffer a constant 
phase delay. To obtain total polarization capability, there must be a second stage 
with the nematic director having a preferential turn in the y-direction. This stage 
would vary the index of refraction for the y-component and give a constant phase de- 
lay to the component polarized in the x-direction. 

It has been assumed to this point that electrodes could be configured to give 
the required radial variation in index of refraction. Since the voltage variation 
desired is radial, it would appear that radially symmetric electrodes would be ideal. 
However, radially symmetric electrodes require difficult mask fabrication and connec- 
tions made to them create obscurations. Row and column addressable, rectangular grid 
electrodes like those used in commercially available liquid crystal displays could be 
considered. In this case, addressing individual electrodes is easy and mask fabrica- 
tion is a simple process, but the device inherently does not match the symmetry of a 
lens. Thus, the structure would cause aberrations. 

A simple structure can be implemented by using crossed linear arrays in tandem. 
Consider a square aperture of width L centered at the origin. Let the transmission 
of the aperture be given by 

t(x,y) = rect (~) rect (^-) exp ( jkAn(x) ) (6) 

L L 


where 


k - wave number 
A = thickness of the material 


and 

2 

n(x) = n - *Trr = index of refraction 
e Z At 

Just past the aperture, if it is illuminated by a unit-amplitude, normally incident 
plane wave, the field distribution U* is 

2 

U'(x,y) = rect (^) rect (^) exp ( jkA(n g - ) (7) 

Ignoring the constant phase term, 

2 

U’(x,y) = rect (^) rect (^) exp ( -- ^ f - ) (8) 
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Now consider placing another transmission factor directly in front of the aperture 
with 


2 

n(y) ' "e “ 2M 

then 

U' = (x,y) t 2 (x,y) 


(9) 

( 10 ) 


U' - rect (2) rect (f) exp ( zV&L +J LI ) (11) 

the ideal form of a lens is obtained with linear electrodes. By cascading two 
stages not only is the mask fabrication process simplified, the required symmetry is 
also maintained. 

Figure 5 shows a single stage cross section and a representative electrode 
structure. Note that the ground electrode is uniform. Also, the variation in volt- 
age profile can be used to generate either a change in index of refraction for x or 
y polarized light. If the preferential direction of the liquid crystal director is 
in the x-direction, only the component of the light polarized in that direction will 
be affected by a varying index of refraction. 

To obtain a complete lens, four stages are required. "Complete” means that any 
incoming polarization is given the appropriate phase transform of a thin lens. Fig- 
ure 6 illustrates the functions of the four stages. The first stage gives the ap- 
propriate variation in index of refraction in the x-direction for light polarized in 
the x-direction. Similarly, the second stage varies the index of refraction in the 
y-direction but still only for light polarized in the x-direction. Thus, if a lens 
was to be created for x-polarized light only, the first two stages would be suffi- 
cient. The last two stages repeat the process for the component polarized in the 
y-direction. Figure 7 shows the cross section of the four-stage liquid crystal 
lens . 


OPTICAL PERFORMANCE ANALYSIS 


Due to the electrode structure used to create the index of refraction variation 
in the liquid crystal, a diffraction-limited lens will not be formed. The wavefront 
leaving the lens will have phase distortions caused by the approximation of a 
smoothly varying index of refraction by a sampled function. Thus, the liquid crys- 
tal lens will inherently have aberrations. However, the smaller the electrodes and 
the spacing between them, the better the index of refraction is matched to a smooth 
curve and therefore the more nearly the lens approaches the diffraction limit. Fig- 
ures 8 and 9 show the predicted MTF for a 2-cm wide lens with a 10-meter focal 
length constructed with 501 and 101 electrodes. Spaces between electrodes are 
assumed to be the same size as the electrodes. Thus, fabrication of the 501 elec- 
trode (20-micron width) is easily achievable with today’s microelectronics capa- 
bility. 
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CONCLUSION 


An electronically controlled liquid crystal lens will provide direct control on 
focusing and point-to-point control on index of refraction for potential adaptive 
optical techniques. The lens has all the favorable characteristics of liquid crys- 
tal displays, such as operation at low voltages and low power dissipation. With 
microprocessor control, it could adapt in real time and be electronically calibrated. 
Using current microelectronic technology, near diffraction-limited performance is 
predicted. 
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APPLIED VOLTAGE 


Figure 3.- Index of refraction versus cell voltage using MBBA. 
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Figure 4.- Cross section of liquid crystal lens. 
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ABSTRACT 


Magnetic diffraction grating materials are currently being developed to provide a 
simple means of deflecting light in a two-dimensional , solid-state fashion. The most 
promising material, for several applications, appears to be bismuth substituted iron 
garnet films in epitaxial form. Calculations indicate that deflection efficiency 
greater than 60% is possible in the near-infrared region of the spectrum. Within the 
field of view of the deflector, measurements predict that 101 resolvable spots can be 
expected. Applications include 1) general purpose deflection of free laser light, 
2) image processing of extended sources such as transparencies, 3) programmable 
lensing, and h) fiber optic matrix switching. 


MAGNETO-OPTIC LIGHT DEFLECTOR 

The active component of this deflector is a dynamically alterable, solid-state 
phase diffraction grating that is the energetically favored domain structure of pro- 
perly configured magnetic materials. A description of the deflector follows. 

Diffraction of a light beam occurs as a result of periodic variations in the wave 
amplitude or phase across a wave normal surface. Magnetic stripe domain arrays can 
introduce a periodic 180° phase variation in an incident optical field, through mag- 
netic birefringence. A stripe domain is a long, straight region of uniform width in 
which the magnetization is nearly constant. Typically, the width can vary from .5 
micron to 30 microns or more while the length can extend to several centimeters P- 
Stripe domains in a given sample may or may not have equal width, depending on sample 
properties and on imposed magnetic fields. 

Consider a linear array of stripe domains In a magnetic platelet as depicted in 
figure 1. Adjacent stripes have £ components of magnetization that are antiparallel 
and usually^have a^continuous component in the x y plane. Because of the Faraday 
effect the E x and E components of an incident optical field suffer clockwise rotation 
in odd-numbered stripes and counterclockwise rotation in even-numbered stripes. This 
differential rotation provides an electric component that is orthogonal to^the inci- 
dent polarization and has 180° alternations (parallel and antiparallel to y) that match 
the spatial period of the domains. The process operates uniformly for all incident 
polarizations, including random. 

In the far-field emerging light adds constructively at angles G n , given by 
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n x , 
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sin 



where n is the order number, X the incident wavelength, and -A the grating period. In 
the special case where each stripe has the same width, even orders are suppressed. 
Magnetic apodization at the transitions between even and odd stripes discourages 
higher orders, as well. 

The n^ 1 odd-order power diffraction efficiency for a square phase grating is 
found from 
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2 2 
n tt 


5 -°(x)t sln 2 F ( X ) t 
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with a the optical absorption coefficient, t the material thickness, and F the Fara- 
day rotation. Figure 2 shows the potential total efficiency of several candidate 
crystalline deflector materials in theyisible and infrared region of the spectrum, 
based on reported values for F and a. 

In order to alter the grating in a solid-state dynamic fashion, reliance is made 
on the strong coupling of the material magnetization and applied magnetic fields 
either coplanar with, or perpendicular to, the stripe domains. There are several ways 
in which a magnetic grating can be field programmed to deflect light. A perpendic- 
ular field changes the stripe width, periodicity, or both, causing linear deflection 
and perhaps a shuffling of light amongst the various allowed orders. If the field is 
applied-, in the plane, and collinear with the domains, then the stripe width varies 
with H , increasing the diffraction angle with increases in field intensity. Final- 
ly, if the field is applied to a general direction in the plane, the grating is re- 
established collinear to the field, resulting in azimuthal deflection. For some 
materials, the field strength necessary to cause deflection in the annular field of 
view is less than that required for Lorentz deflection of electron beams in CRTs. 

Thus, microsecond switching speeds are possible with watt level electrical power. 


GARNET 


Of the deflector candidates contrasted in figure 2, rare earth iron garnet is 
presently the preferred material for a variety of reasons. It is readily obtainable 
as epitaxial films up to 100 microns thick grown on gadolinium gallium garnet, com- 
monly employed as magnetic bubble memory substrates. Stable stripe domain arrays of 
sufficient quality to provide 10? resolvable spots in the annular field of view have 
been observed. Curie points to 550 K insure that the magnetization is nearly constant 
over a wide temperature range, including room ambient. The applied fields required to 
manipulate the stripe domains are on the order of 100 oe . , derivable from computer 
controlled Helmholtz pairs or strip lines in close proximity to the crystal. With 
bismuth doping the Faraday rotation can reach 50,000°/cm in the visible and 12,000°/ 
cm in the near- infrared/ The optical absorption exhibits a local minimum at the .81 
micron fiber-optic wavelength and a large window of near-zero absorption at wave- 
lengths greater than 1.2 microns. Calcium doping has been successfully used to re- 
duce the absorption at important wavelengths . ° From experimental data obtained with 
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a Faraday hysteresigraph and a spectrophotometer on thin epitaxial samples it Is 
possible to calculate the maximum deflection efficiency of thick samples or of thin 
samples operated inside an optical resonant cavity. The calculation has been done 
and the results are shown in figure 3- Work is currently under way to experimentally 
evaluate these expectations and to develop the necessary crystal growth facilities to 
exploit the potential of bismuth garnet for light deflector applications. 


APPLICATIONS 


Agile light deflectors based on magneto-optics offer unique solutions to a number 
of optical switching and processing problems. Obvious applications include laser ra- 
dar at 1.06 microns and 10.6 microns, focal plane array scanners, and optical communi- 
cations between moving platforms. Because stripe domain arrays can be plastically 
deformed by spatially varying fields, adaptive optic processing of images of extended 
objects can also be performed. In guided wave communications, bismuth garnet, In its 
present form, serves quite well as a high fanout fiber optic switch. 


Adaptive Optics 


In a stripe domain grating the periodicity, stripe width, and orientation need 
not be constants over the aperture. Spatially varying magnetic fields can be utilized 
to locally modify the deflection of Incident light from pointlike sources or from ex- 
tended sources such as collimated beams or transparencies. By properly tailoring the 
grating, dynamically alterable compressors or expanders and image rotators can be 
generated. 

One -dimensional compressors or expanders are constructed from the domain pattern 
that has the basic spatial modulation seen in figure 4a. Both the stripe density and 
grating orientation are functions of local field. This would be useful for correcting 
image distortions. 

Gratings obtained by converting the linear array of a film with some In plane 
magnetic component, into the radial array of figure 4b, provide a means for continuous 
distortionless image rotation if the grating has the appropriate radial stripe density 
gradient. A geometric optic analysis reveals that the gradient is such that the stripe 
density is inversely proportional to radius. Then the impressed image rotation angle, 
B, is found to be 


B = tan~ ^ s X , 

2c 

with s the distance to the post grating Image plane and c a quantity that depends on 
field magnitude and stripe widths. In addition, the image experiences a magnification 
of sec B. Continuous rotation occurs with intensity changes in the field. 

In order to initialize the grating to a radial mode it is necessary to bring a 
point pole into approximate contact with the film surface. The symmetry in the pole's 
field causes the grating to assume the desired form but with constant stripe density. 
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In practice the pole can he supplied by a polished ferrite needle. The inplane field 
that develops the proper stripe density was found to match the tangential field pro- 
duced by an extended magnetic polepiece. Thus, it has been possible to observe both 
the rotation and the scaling properties of the radial domain image processor. Rota- 
tions to near 90° were recorded. Since the required polepiece is available in ferrite 
form ; high speed rotation is expected. 

Arrays formed from domains that are concentric annuli about a fixed center may 
act like a Fresnel zone plate ; i . e . , collimated light focuses on the zone axis when 
the domain widths satisfy the pertinent Fresnel relations. If the incident light im- 
pinges on just a sector of this array as in figure 4c, focusing occurs off the optic 
axis. In either case, dynamic control of the domains implies dynamic lensing. Most 
likely, garnet materials for this application are of the bubble type because they 
have only a perpendicular magnetization component. 


Fiber Optic Switch 


Useful deflection efficiency at the present fiber optic wavelengths, along with 
magnetic control of the intrinsic grating in garnet films is the basis for a multi- 
port fiber optic switchboard as seen in figure 4* Information-bearing light, propa- 
gating in any or all of the elements of the input fiber 2-dimensional matrix is 
collimated by gradient index lenses (GRIN) . Light from a given fiber lens falls on 
just one deflector element which steers it to one fiber in the focal plane of the 
output lens. The switchboard has high fanout with just a single level of optical 
switching. Greater than 100 X 100 input or output arrays can be accommodated. 

Figure 6 shows a basic version of a fiber optic switch that utilizes only the 
inner circumference of the deflector field of view. Input bus light is tapped by 
selected output fibers that are arranged at the appropriate positions on the 1/0 face 
of a GRIN lens. This version provides a set-and -forget feature; i.e., once the orien- 
tation of the stripes has been established by the field coils to direct input light 
to the selected tap, all coil current can be removed until another tap fiber selec- 
tion is required. 

Meaurements and observations to date indicate that the switch has a number of 
other desirable features. It is insensitive to incident polarization, making it par- 
ticularly attractive for fiber optics. Crosstalk can be held to < - 20 db. with a 
fanout of 10. Physical size can be as low as 1 cm3 as shown in figure 1, a working 
model of a 1 X 3 switch. Field coil switching is approximately 1 watt with micro- 
second select time. As a diffraction grating it satisfies the reciprocity theorem 
and is dispersive, allowing two-way operation of multiwavelength carriers. These 
attributes provide significant systems potential. 

One useful systems device is an optical switch that can access just one output or 
simultaneously access all outputs, on electrical command. It was demonstrated that 
the basic stripe domain fiber optic switch can accomplish this because of its adaptive 
optic capability. With reference to figure 4b, if the stripe grating is switched to 
the constant periodicity radial format discussed under adaptive optics, the deflection 
space is a thin annulus at the same deflection angle as for the linear grating. Since 
the tap fibers are arranged to intercept light at this angle, they are uniformly il- 
luminated with the light propagating in the bus fiber. Thus, all tap fibers are 
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simultaneously selected. This can be implemented by placing a point magnetic pole on 
the optic axis, just behind the mirror. Switching between single select and the 
multiselect star coupler mode should occur in microseconds. 

Because of its high fanout the garnet fiber optic switch may have use as a resi- 
due arithmetic adder for optical computing. Figure 8 is a schematic of a possible 
modulo 5 adder in which input fiber light represents one addend and the grating se- 
lected output represents the other. 


SUMMARY 


Magnetic stripe domain arrays in bismuth iron garnet epitaxial films are alter- 
able in orientation and grating constant with externally applied magnetic fields 
generated by high speed deflection circuitry. Optical radiation, incident upon the 
grating, is thus diffracted in a two-dimensional solid-state fashion. Favorable 
deflection efficiencies and local control of the grating suggest a number of applica- 
tions. Further work may lead to compact devices that are applicable to robotics and 
optical processing. 
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INCIDENT LIGHT 


Figure 1.- Magneto-optic diffraction grating. 
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Figure 2.- Calculated diffraction efficiency of stripe 

domain materials . 










Figure 7<- Operational model of 1 x 3 garnet fiber optic switch 
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EFFICIENT BUTT COUPLING OF HIGH POWER CDH-LOC LASERS 
TO Ti-LiNb0 3 (LNT) OPTICAL WAVEGUIDES* 

J. M. Hammer, D. Botez, C. C. Neil and J. C. Connolly 

RCA Laboratories 
David Sarnoff Research Center 
Princeton, NJ 08540 

We have observed direct butt coupling of over 50 percent of the light from 
CDH-LOC lasers into both Ti indiffused LiNb0 3 optical waveguides. This is the 
highest direct coupling of junction laser light to the technologically 
important class of indiffused Li Nb0 3 type waveguides. Such waveguides are in 
use for a variety of optical switching and processing applications. In 
particular, compact high-frequency spectrum analyzers using Ti-LiNb0 3 are in an 
advanced state of development for military applications. The RCA CDH-LOC 
lasers used in this work have been brought to a high degree of effectiveness. 
These lasers provide single spatial mode output at the highest CW power (40 mW) 
reported. 

Efficient butt coupling requires that the beam waist size of the laser be 
matched to that of the optical waveguide. The CDH-LOC laser is ideal for this 
purpose in that large beam waists commensurate with those of the waveguides are 
available. Thus, by producing lasers with the appropriate beam waist size the 
high coupling efficiency reported here is obtained. 

The experiment consists of measuring the light coupled out of the 
waveguide by a prism coupler, when laser light is coupled into the W.G. The 
coupling is accomplished as follows: the laser junction plane is aligned with 

the waveguide plane and the laser output facet brought into close proximity to 

* Because this paper was not available at the time of publication, only the 
abstract is included. 
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the polished waveguide edge using a high resol ution-six-degree- freedom 
positioner. 

Both the experimental arrangement of the coupler and our observations will 
be described in detail. The outstanding features of the CDH-LOC lasers will 
also be reviewed. 

This work is partially supported by NASA and NRL. 
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